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ABSTBACT 

Predicting  officer  attrition  is  a  major  difficulty  in 
the  Planning,  Programming,  and  Budgeting  Process.  The 
Marine  Corps  has  historically  used  aggregated  estimates  for 
attrition  rates.  The  purpose  of  this  paper  is  to  demon- 
strate the  application  of  the  James-Stein  and  other 
shrinkage  type  parameter  estimation  schemes  for  the  purpose 
of  generating  more  stable  manpower  loss  rates.  It  has  been 
illustrated  within  this  project  that  improvement  can  be 
attained  by  application  of  the  James-Stein  and  Minimax 
shrinkage  methods  rather  than  the  more  natural  Maximum 
Likelihood  estimation  process.  The  shrinkage  schemes 
employed  herein  offer  powerful  and  useful  methods  for 
generating  attrition  rates. 
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I.  INTBODDCTION 

A.   GENEFAL 

Ibis  paper  is  the  result  of  a  Headquarters  Marine  Corps 
request  to  explore  the  use  of  James-Stein  and  other 
shrinXage  type  parameter  estimation  schemes  for  the  purpose 
of  generating  manpower  loss  rates.  The  very  large  number  of 
cells  within  the  USMC  officer  force  structure  leads  to  tne 
condition  that  empirical  attrition  rates  are  unstable.  This 
problem  is  compounded  by  the  fact  that  the  cell  probabili- 
ties are  saiall.  Further  difficulties  are  present  because 
some  of  the  inventory  cells  are  empty  for  structural  reasons 
while  others  are  empty  by  chance.  Tnerefore,  the  small 
inventory  cells  draw  especial  attention. 

In  order  to  deal  with  this  phenomenon,  variance  stabi- 
lizing and  symmetrizing  transformations  followed  by  versions 
of  the  James-Stein  shrinkage  technique  were  applied  to 
selected  aggregates  of  the  USMC  officer  data.  Tae  applica- 
tion of  these  methods  to  the  summary  data  provided  by  Navy 
Personnel  Research  and  Development  Center  (NPRDC)  has  shown 
much  promise.  It  has  been  illustrated  within  this  project 
that  improvement  can  be  attained  from  application  oi  these 
methods  over  other  estimating  metnods. 

Since  considerable  further  work  is  needed  to  screen  our 
choices,  a  further  purpose  of  this  thesis  is  to  document  a 
number  of  supporting  materials,  including  a  thorougn  discus- 
sion of  the  following: 

1.  Background   of  attrition  and   promotions   for   USMC 
officers. 

2.  Data  types  and  the  immediately  avaiiaide  data  base. 
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3.  Identifying   the   plans    (consumers)    tnat   utilize 
attrition  rates. 

4.  Formulas  and  mataematical   risk  calculations  relating 
to  the  James- Stein  and  empirical  Bayes  estimators. 

Cross  validation  procedures  were  utilized  to  illustrate 
tne  improvement  of  shrinkage  methods  over  the  use  of 
empirical  rates, 

B.   fiACKGfiOQND 

The  forecasts  made  by  Manpower  planning  models  are 
affected  by  three  general  factors:  existing  inventory, 
projected  losses,  and  projected  gains.  In  order  to  project 
the  inventory  into  various  future  time  periods  it  is  neces- 
sary to  use  a  realistic  system  of  flow  rates.  Currently 
those  rates  are  generated  largely  through  arbitrary  manual 
processes.  The  goal  of  this  paper  is  to  examine  ways  to 
develop  objective  and  computerized  loss  rates. 

There  is  an  inherent  confusion  in  terminology  of  losses 
and  loss  rates  in  that  some  are  leavers  from  the  ilarine 
Corps  and  some  are  merely  transitioning  within  the  force 
structure.  Flows  of  Officers  from  particular  cells  are 
cnar act erized  by  Military  Occupational  Specialty  (MOS) , 
length  cf  service  (LCS) ,  grade,  and  perhaps  some  other  cross 
classifying  characteristics.  When  the  flows  go  from  one 
cell  tc  anotner  within  tne  Marine  Corps,  then  the  flow  is 
referred  to  as  a  strength  loss  lor  tne  former  ceil.  An 
officer  moving  from  one  cell  tc  anotner  cell  does  not  mean  a 
loss  from  the  Marine  Corps  to  tne  civilian  lacor  market. 
Changes  such  as  promotion  to  a  higher  pay  grade  or  losses  to 
a  cell  due  to  an  officer  increasing  from  one  length  of 
service  to  tne  next  are  other  forms  of  transition  within  tae 
force  structure. 
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Losses  from  within  the  Marine  Corps  to  the  civilian 
labor  market  are  the  only  losses  treated  in  this  thesis. 
They  can  be  voluntary  or  involuntary.  Voluntary  losses 
occur  when  officers  resign,  retire,  or  are  released  by 
choice.  Involuntary  losses  occur  due  to  discharge,  death, 
disability,  release  from  active  duty,  and  retirement. 
Hetirement  can  oe    voluntary  or  involuntary. 

Authors  who  have  studied  loss  rates  and  how  they  apply 
to  manpower  planning  are  emphasizing  the  importance  of 
understanding  these  rates  in  order  to  conduct  proper 
forecasting.   [Ref.  1]  [Ref.  2] 

A  difficulty  encountered  in  doing  this  project  was  that 
of  obtaining  sufficient  data  points  with  which  to  conduct 
cohort  or  census  manpower  data  analysis  [Ref.  1].  The 
reason  for  this  difficulty  was  the  state  of  the  Marine  Corps 
automated  data  reporting  system  prior  to  1976.  During  tne 
period  1970  to  1970,  tae  Marine  Corps  was  instituting  its 
first  automated  personnel  reporting  system.  Many  proDlems 
were  encountered  during  this  period.  Therefore,  data  from 
this  period  is  generally  unreliable.  Data  at  the  Defense 
Manpower  Data  Center  is  based  on  input  from  Headguarters 
Marine  Corps  and  was  found  to  have  similar  problems.  A 
complicating  factor  during  this  period  was  the  Vietnam 
conflict  and  the  attritions  generated  by  the  conflict. 
These  problems  prevented  an  extended  data  base  from  whicn  to 
draw  from  for  the  purpose  of  this  paper.  The  Marine  Corps 
manpower  reporting  system  was  amended  in  197b  and  by  1977, 
the  system  was  reporting  data  with  over  95  percent  accuracy. 

C.   IBE   HARINE    CORPS   OFFICER   ATTRITION    AND   PROMOTION 
STR DCTOBE 

An  aitalysis  of  a  Marine  Corps  officer  as  he  moves 
through  his  career   reveals  key  periods  in   wnich  an  officer 
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is  most  likely  to  leave  the  service.  These  periods  coincide 
witn  promotion  points  in  the  service.  Under  normal  circum- 
stances, unrestricted  2nd  Lieutenants  are  promoted  to  1st 
Lieutenants  after  24  months  m  grade.  1st  Lieutenants  are 
promoted  to  Captains  in  the  Fifth  year  of  commissioned 
service.  Captains  to  Majors  during  the  tenth  year.  Majors  to 
Lieutenant  Colonels  during  the  sixteenth  year  of  service, 
and  Lieutenant  Colonels  to  Colonels  during  tne  twenty  second 
year  of  service.  An  important  influencing  factor  is  the 
fact  that  retirement  benefits  are  not  achieved  until  the 
twentieth    year    of    total   service. 

1  •      Lieutenants 

Lieutenants  normally  have  a  four  or  five  year 
initial  term  of  service  depending  on  their  source  of  commis- 
sion and  Military  Occupational  Specialty  (MOS)  ,  either  air 
or  ground.  Aviation  Lieutenants  are  some  what  special. 
Only  a  small  numi/er  of  aviation  Lieutenants  leave  the 
service.  This    is      dae      to    their      longer      initial    term      of 

service,  wnich  causes  them  to  reach  the  grade  of  Captain 
before  tne  level  of  natural  attrition  is  more  likely  to 
occur.  Attrition  of  Aviation  Lieutenants  is  primarily  a 
function   of   accidents,    illness,    and    disciplinary    problems. 

2 .      Captains 

There  is  a  much  greater  period  oi  flexibility  for 
Captains  given  that  they  are  regular  officers  or  reserves 
witn  an  extension  of  service  agreement.  Captains  are 
promoted  at  five  years  of  commissioned  service  and  are 
either  passed  over  or  promoted  to  Major  by  their  eleventh 
year  of  service.  Thus  in  a  period  oi  over  six  years,  an 
individual  may  leave  the  service  voluntarily  at  any  point. 
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3 .  Majors 

The  rank  of  Major  is  simpler  to  characterize. 
Pre-DCPMA  (Defense  Officer  Personnel  .Management  Act)  ?lajors 
have  a  guaranteed  length  of  service  of  twenty  years  commis- 
sioned service.  Since  they  are  promoted  during  the  tenth 
and  eleventh  years  of  commissioned  service  and  are  either 
passed  over  or  selected  for  Lieutenant  Colonel  in  tneir 
sixteenth  to  twentieth  years  of  service,  they  have  a  period 
of  six  to  ten  years  in  which  to  make  a  decision  about 
leaving.  Post-DOPMA  Majors  do  not  have  a  guaranteed  length 
of  service  of  twenty  years  but  there  are  provisions  which 
allow  the  retention  of  these  officers  on  active  duty  based 
on  their  skills  and  the  findings  of  a  special  board  at 
Headquarters    Marine   Corps.  By   the   time  officers      make   the 

rank  of  Major  most  have  essentially  committed  themselves  for 
a  tweoty  year  term  of  service.  Reaching  the  twenty  year 
retirement  point  assures  one  a  satisfactory  retirement 
program. 

At  tne  sixteenth  year  of  service,  a  Major  is  first 
eligitle  for  selection  to  Lieutenant  Colonel.  The  selection 
process  may  be  repeated  for  four  years  until  the  officer  is 
either  promoted  or  passed  over  and  retired  at  twenty  years 
of  service.  During  this  later  period  it  is  very  unlikely 
that  any  Major  would  leave  the  service  unless  he  had  prior 
enlisted  service.  That  would  allow  him  to  achieve  a  total 
of    twenty   years  of   service   earlier. 

4 .  Lieutenant  Colonels 

Lieutenant  Colonels  are  usually  promoted  at  sixteen 
years  of  commissioned  service  and  may  continue  in  service 
until  the  twenty  sixth  year  of  service.  Lieutenant  Colonels 
incur  a  three  year  obligation  upon  promotion;  therefore,  at 
any  point  after  their  third  year  in  grade  upon  exceeding 
nineteen  years  of  service  they  may  retire  voluntarily. 
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D.  AITRITIOM  OF  AVIATORS 

The  attrition  rate  of  aviation  officers  was  affected  by 
the  intiation  of  Aviation  Officer  Continuation  Pay  (AOCP)  in 
198  1.  AOCP  provides  a  bonus  per  year  to  aviation  officers 
and  this  in  turn  obligates  continued  service.  The  program 
was  applied  to  all  ranks  provided  the  individual  met  certain 
active  duty  flight  status  requirements.  This  action  by  DOD 
has  had  its  desired  effect  upon  the  retention  of  aviation 
officers.  This  accounts,  in  part,  for  the  reduction  in 
attrition  rates  for  Lieutenant  Colonels  in  1983  and  for 
Majors  and  Captains  in  1982  and  1983, 

E.  TJfiaS 

The  following  terms  will  be  utilized  within  this 
pro  j  ect : 

1.  Aggregation  -  collection  of  historical  officer  data 
over  dist inguisning  characteristics.  For  example, 
the  aggregation  of  all  grades  within  a  Military 
Occupational  Specialty  (MOS) . 

2.  Attrition  -  any  departure  from  the  Marine  Corps  cy  an 
officer. 

3.  Attrition  rate  -  idealized  ratio  of  attritions  to 
inventory. 

4.  End-strength  -  total  number  of  the  Marine  Corps 
officers  at  the  end  of  a  specified  period  of  time. 

5.  Limited  Duty  Officer  (LDC)  -  professionally  qualified 
officers  specifically  designated  for  limited  duty 
within  certain  MOS's. 

6.  Unrestricted  -  all  ofticers  not  designated  as  Limited 
Duty  Officers  (LDO) . 

7.  Regular  -  unrestricted  officers  not  specifically 
assigned  to  a  "Reserve"  status. 
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8.      Heserve  -      unrestricted   officers    with     a    specifically 
assigned   "Reserve"   status. 
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II.  STfiOCTUBE  OF  THE  DATA  BASE 

A.   SOHHABI  DATA  FILE 

The  data  used  ic  this  project  was  obtained  from  a 
summary  data  file  from  the  Commander,  Navy  Personnel 
Research  and  Development  Center  (NPfiDC) .  The  summary  data 
file  was  generated  from  two  source  files:  the  Headquarters 
Master  File  (HMF)  and  the  Quarterly  Statistical  Transaction 
File  (SIATS) .  The  Headquarters  Master  File  (HMF)  was  used 
to  produce  Historical  officer  inventories  as  of  the  begin- 
ning cf  the  fiscal  year.  Inventories  were  generated  for 
each  fiscal  year  from  1977  through  fiscal  year  1983.  ihe 
inventories  were  identified  by  distinct  characteristics. 
These  characteristics  were  Military  Occupational  Specialty 
(MCS),  grade  and  length  of  service. 

The  data  in  hand  does  not  distinguish  between  unres- 
tricted officers  and  limited  duty  officers.  For  that 
reason,  discussion  cf  the  characteristics  that  separate 
these  two  types  cannot  be  used  advantageously  herein.  Such 
discussion  is  included  however  in  anticipation  of  receiving 
a  more  discriminating  data  base  in  the  future. 

Ihe  G*iarterly  Statistical  Transaction  File  was  used  to 
generate  Historical  losses.  Within  tnis  file,  if  the  Type 
Transaction  Code  indicated  a  loss,  tnen  the  Effective  Date 
of  Action  field  would  specify  the  year  and  month  of  the 
loss.  Losses  were  classified  into  eight  categories  for  each 
fiscal  year  1977  tnrcugh  1983.  The  losses  were  furtner 
classified  into  distinct  elements  by  MOS,  grade  and  length 
of  service. 


B.  SOHHABY    DATA    FORflAI 

Ihe  summary  data  file  classified  the  Marine  Corps 
Officer  inventory  into  UO  Military  Occupation  Specialties, 
10  grade  levels,  31  lengths  of  service,  and  8  loss  catego- 
ries. Ihe  data  format  is  defined  by  Tables  I  and  II.  The 
columc  containing  the  letters  A  through  E  refer  to  the 
structural  zero  problem  discussed  in  the  next  section.  See 
the  material  surrounding  the  contents  of  Table  V. 

Throughout  the  remainder  of  this  project,  when  reference 
is  made  to  a  particular  Military  Occupational  Specialty 
(MOi)  ,  the  data  code  reference  from  Table  II  will  be  used 
instead  of  the  actual  MOS.  For  example,  this  project  will 
refer  to  the  Utilities  MOS  as  number  06  and  not  11.  It 
should  also  be  understood  that  the  two  digit  MOS  identifier 
listed  in  Table  II  is  strictly  the  Occupational  Field  iden- 
tifier in  the  USMC  MOS  Manual.  This  results  in  MOS,  for  the 
purposes  of  this  paper,  being  an  aggregation  of  what  is 
usually  understood  to  be  an  MOS  cell. 

C.  SlfiOCTUBAL    AND   SABFLING    ZEBOES 

There  are  two  kinds  of  zero  inventory  cells  that  must  be 
dealt  with: 

1.  Structural  zeroes:  A  cell  wnose  inventory  is  always 
zero  because  certain  grades  and  length  of  service 
combinations  should  never  appear  in  the  Military 
Occupational  Specialty  (MOS) .  For  example,  a 
Lieutenant  Colonel  with  4  years  of  service  in  any  MOS 
or  an  infantry  warrant  officer  in  MOS  03  does  not 
exist. 

2.  Sampling  zeroes:  according  to  the  sample  data,  there 
is  no  officer  in  the  particular  grade  and  length  of 
service  combination  for  a  given  fiscal  year  and 
Military  Occupational  Specialty.  This  is  not  a 
permanent  condition,  but  one  tnat  occurs  by  cnance. 
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COLUMN 

1-2 
3-4 
5-6 
7-8 
9-13 
14-53 


CODE 

0 
1 
-  2 
3 
4 
5 
6 
7 
8 
9 


CODE 

1 

2 
3 
4 
5 
6 
7 
8 


TABLE    I 

DATA    FCEMAT 

DATA 

YEAR    -     (FISCAL)      ( 1  977 .  . .  1  983) 
)UP    (0...3g) 


f<OS    GEOUP     (0, 

GRADE 

LENGTH    OF    SERVICE 

BEGINNING    OF     YEAR 

LOSS    COUNTS     (TYPES     1 8) 


(0.. .30) 

INVENTORY 


:Efi 


GRADE 

WARRANT    OFFICER     (W-1 

CHIEF     WARRANT    OFFIC" 

CHIEF    WARRANT    OFFICER 

CHIEF    WARRANT    OFFICER 

SECOND    LIEUTENANT 

FIRST    LIEUTENANT 

CAPTAIN 

MAJOR 

LIEUTENANT    COLONEL 

COLONEL 


TYPE    LOSS 

VOLUNTARY    RESIGNATION 
VOLUNTARY    RETIREMENT 
INVOLUNTARY    -    DEATH 
INVOLUNTARY    -    DISCHARGE 
INVOLUNTARY    -    DISABILITY 
RELEASE    FROM    ACTIVE    DUTY 
DISABILITY    RETIREMENT 
INVOLUNTARY    RETIREMENT 


CWO-2 

CWO-3 
CHO-4 


For  tne  purpose  of  clarity  in  this  paper,  the  ncnstruc- 
turai  zero  region  will  be  referred  to  as  being  within  the 
Feasible  Region.  This  region  is  defined  by  the  lower  and 
upper  limits  of  the  length  of  service  for  eacn  grade.  Table 
III  identifies  the  miniaium  and  maximum  years  of  commissioned 
service      for      the      Marine      Corps        officer      grades      for      the 
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TABLE   II 
fllLITABY   OCCUPATIONAl    SPECIALTIES     (flOS) 

DAIA 

CODE  M03  CAT  MOS    TIILE 

00  UN  A  UNKNOWN 

0  1  01  A  PERSONNEL  AND  ADMINISTRATION 

02  02  A  INTELLIGENCE 

03  03  C  INFANTRY 
OH  04  A  LOGISTICS 

05  08  A  FIELD  ARTILLERY 

06  11  D  UTILITIES 

07  13  A  ENGINEER,  CONSTRUCTION,  AND  EQUIPMENT 
06  14  D  DRAF1ING,  SURVEYING,  AND  MAPPING 

09  15  D  PRINTING  AND  REPRODUCTION 

10  18  C  TANK  AND  AMPHIBIAN  TRACTOR 

11  21  A  ORDNANCE 

12  23  B  AMMUNITION  AND  EXPLOSIVE  ORDNANCE 

DISPOSAL 

13  25  A  OPERATIONAL  COMMUNICATIONS 

14  26  A  SIGNALS  INTELLIGENCE/GROUND  ELECTRONIC 

WARFARE 

15  28  B  DATA/COMMUNICATIONS  MAINTENANCE 

16  30  A  SUPPLY  ADMINISTRATION  AND  OPERATIONS 

17  31  A  TRANSPORTATION 

18  33  A  FOOD  SERVICE 

IS  34  A  AUDITING,  FINANCE,  AND  ACCOUNTING 

20  35  A  MOTOR  TRANSPORT 

21  40  A  DATA  SYSTEMS 

22  41  B  MARINE  CORPS  EXCHANGE 

23  43  A  PUBLIC  AFFAIRS 

24  44  A  LEGAL  SERVICES 

25  4b  A  TRAINING  AND  AUDIOVISUAL  SUPPORT 

26  55  E  BAND 

27  57  D  NUCLEAR,  BIOLOGICAL,  AND  CHEMICAL 

28  58  A  MILITARY  POLICE  AND  CORRECTIONS 

29  59  B  ELECTRONICS  MAINTENANCE 

30  60  A  60XX 

31  61  A  AIRCRAFT    MAINTENANCE 

32  63  B  AVIONICS 

33  65  B  AVIATION    ORDNANCE 

34  68  B  WEATHER    SERVICE 

35  70  D  AIRFIELD    SERVICES 

36  72  A  AIR    CONTROL,     AIR    SUPPORT,    AND    ANTI-AIR 

WARFARE 

37  73  A  AIR  TRAFFIC  CONTROL 

38  75  C  PILCIS  AND  NAVAL  FLIGHT  OFFICERS 

39  99  E  IDENTIFYING  «0S  AND  REPORTING  MOS 


unrestricted  as  well  as  the   Limited  Duty  Officer.    Recall, 
tne  latter  type  is  not  separated  in  the  current  data. 
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TABLE    III 

fllNIflOH    AMD    HAXIMOH    Y EABS-OF-COMMISSIO NED 
SEflVICE    FOE    OFFICER    GRADES 

Grade  Unrestricted  LDO 

MIN    yes         MAX    YCS  MI N    YCS  MAX    YCS 


WOl 

1 

4 

CW02 

4 

9 

CW03 

8 

13 

CW04 

12 

24 

2  IT 

1 

4 

ILT 

1 

8 

CAPT 

4 

13 

MAJ 

9 

20 

lie  01 

14 

23 

COL 

21 

30 

1  8 

2  12 
5  18 

8  23 


In  order  to  get  the  officer's  length  of  service  one  mast 
add  the  ofticer's  prior  enlistment  period  to  the  officer's 
years  of  commissioned  service.  Since  prospective  warrant 
officers  are  reguired  to  have  a  minimum  of  five  years  of 
service  prior  to  consideration  by  tne  officer  selection 
board,  the  logical  lower  limit  for  length  of  service  for 
warrant  officers  is  five  years.  The  LOS  then  would  include 
the  total  of  this  service  and  the  respective  years  of 
commissioned  service.  Additionally,  warrant  officers  are 
required  to  nave  10  years  of  service  prior  to  consideration 
to    oecome      Limited   Duty    Officers.  This    affects      the    lower 

limit  of  tne  length  of  service  for  Limited  Duty  Officers. 
Tae  logical  lower  limit  for  length  of  service  for  LDO's 
would  be    10   years. 

The  summary  data  provided  for  this  project  did  not 
distinguish  between  unrestricted  and  Limited  Duty  Officers. 
Additionally,  the  length  of  service  was  the  characteristic 
provided  instead  of  the  years  of  commissioned  service  for 
each    of    the   officer    grades.  Table   IV    identifies    the   lower 
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and  upper  limits  of  the  Feasible  Region  of  the  length  of 
service  for  Marine  Corps  officer  grades  utilized  in  this 
project. 


r 

TABLE  I? 

MIIIHOM 

AND  HAXIHOM  LENGTH  OF 
FOB  OFFICER  GRADES 

SERVICE 

Grade 

MIN  LOS 

MAX  LOS 

WOl 

CW02 

CH03 

CW04 

5 

8 

12 

16 

20 
30 
30 
30 

2LT 

1LT 

CAPT 

MAJ 

LTCOL 

COL 

1 
1 

U 

9 

14 

21 

20 
20 
30 
30 
30 
30 

Note  the  maximum  limit  in  the  majority  of  grades  can 
feasitly  be  30  years.  This  was  substantiated  by  the  actual 
data  wherein  the  maximum  length  of  service  for  grades  CW03 
through  Colonel  was  30  years.  The  maximum  length  of  service 
for  hiCI's,  2LI's  and  lLT*s  was  limited  to  20  years  because 
of  the  very  small  number  of  actual  officers  with  21  to  30 
years  of   service    within   these    grades. 

One  method  of  considering  the  size  of  the  feasible 
region  is  to  create  a  multidimensional  array  using  all 
combinations  of  different  characteristics  within  the  data. 
If  all  characteristics  were  considered  for  all  seven  fiscal 
years,  the  size  of  the  multidimensional  matrix  would  be 
7x40x31x10x8-  This  would  represent  7  years,  40  MOS's,  31 
LOS's,         10    grades      and    8      loss    types.  This    would      create 

694,400      different  cells      that    could      be      dedicated    to      loss 
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counts.  However,  to  bring  the  dimensicn  to  a  more  manage- 
able level,  consider  one  year  of  data  for  eacn  loss  type 
having  a  cell  count  oi  40x3  1x10.  Therefore,  for  each  loss 
type  within  a  fiscal  year  there  is  a  total  of  12,400  cells. 
Many  cf  these  cells  are  infeasxble  and  are  designated  those 
wita    structural   zeroes. 

The  region  of  structural  zeroes  is  the  area  outside  of 
the  minimum  and  maximum  length  of  service  for  the  officer 
grades  as  listed  in  Table  IV.  But  it  is  further  articulated 
because  of  additional  constraints  imposed  by  the  various  ilOS 
classifications.  This  is  largely  because  of  the  suitability 
of  certain  MOS*s  being  assigned  to  unrestricted,  warrant  and 
Limited  Duty  Officers.  The  particular  ilOS's  are  identiried 
for  illustration  purposes  by  letter  in  Taisle  II  according  to 
the  general  categorization  listed  in  Table  V.  This  general 
MOS  categorization  specifies  by  category  the  inclusive 
grades  within  certain  aos's,  the  number  of  MOS's  with  the 
specified  grade  structure,  the  number  of  structural  zeroes 
per  flOS,  and  the  total  structural  zeroes  within  each 
category.  The  total  number  of  structural  zeroes  is  6149- 
Therefore,  the  number  of  ceils  within  the  feasible  region  is 
625  1    for   each    loss   type   for    a    given   fiscal    year. 

D.       CENTfiAL    DATA 

The  following  is  an  observation  made  by  the  autnor 
regarding  the  summary  data  utilized  in  this  project.  This 
data,  which  was  provided  by  Navy  Personnel  Research  and 
Development  Center  (NPRDC),  was  classified  as  central  data 
according   to      Bartnolcmew   [ Ref .    1:         p.       25].  Bartholomew 

defines    central   wastage    (attrition)    rates   as   follows: 

1.  The  central  wastage  (attrition)  rate  is  the  number  of 
leavers  during  the  period  who  were  in  this  class  when 
they  left  divided  by  the  average  number  in  this  class 
during    the    period. 
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TABLE    ? 
STRUCTOfiAL    ZERCES   CATEGORIES 

Stru.  Total 

Grades                                Number        Zeroes  Zeroes 

Category        Within    MOS                      of    MOS        per   MOS  per   Cat. 

A                   W01...LIC0L                           23                 129  2967 

B                    WOl... CH04-LD0                      8                 159  1272 

C                    2LT...LIC0L                              3                 202  606 

D                    W01...CW04                                5                 237  1185 

E                   W01.,.CCL                                   1                  119  119 

TCTAL                                                                       40  6149 


A  problem  arose  on  several  occasions  when  the  data  was 
disaggregated  to  a  level  where  the  inventory  was  very  small. 
For  example,  when  examining  the  inventory  in  a  particular 
fiscal  year,  the  inventory  was  zero  ror  a  Length  of  Service 
(i.OS)  and  Military  Occupational  Specialty  (MOS)  combination. 
Examining  the  inventory  in  the  next  fiscal  year  for  the  same 
LOS  and  MOS  combination  also  was  zero.  However,  the  problem 
arises  when  the  number  of  leavers  is  equal  to  or  greater 
than  one.  The  average  of  the  inventories  for  the  two  fiscal 
years  is  zero.  Using  this  result  in  the  estimation  of  the 
attrition  rate  would  be  ambiguous.  The  zero  in  the  denomi- 
nator of  the  ratio  of  leavers  to  average  inventory  results 
in  an  undefined  expression. 

For  tne  purpose  of  removing  tnis  ambiguity  in  the  data, 
the  following  policy  was  adopted  to  define  the  central 
inventory  number  for  the  officer  force  at  disaggregated 
levels;  For  any  cell  or  collection  of  cells 

1.  Let,  t  =  1...6,  refer  to  tne  years  1977. ..1982. 

2.  Let,  Y  (t)  =  Number  of  losses  in  year  t. 

3.  Let,  INV(t)  =  Inventory  in  the  beginning  of  year  t. 
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4.  Let,  N  (t)  =  Maximum  of  Y  (t)  and  the  average  inventory 
in  year  t,  using  the  teginnxng  inventory  in  year  t 
and  t+1  and  computing  their  average; 
(INV(t)  +INV  (t  +  1))  /2. 

5.  Let,  m  =  Sam  cf  Y(t)  divided  by  sum  of  N(t),  (both 
sums  over  t)  represents  the  empirical  central 
attrition  rates  for  the  particular  cell  or  aggregate. 

The  data  bank  that  will  ultimately  be  utilized  in  the 
parent  project  at  NPREC,  of  which  this  project  is  a  subset. 
Will  contain  the  appropriate  inventory  and  attrition  data  so 
that  tne  above  modifications  will  not  be  required.  It  is 
currently  planned,  ty  NPRDC,  to  produce  the  historical 
officer  inventory  as  of  the  beginning  of  a  month  and  the 
losses  will  be  identified  by  the  month  in  which  the  officer 
leaves  the  Marine  Corps.  This  will  provide  the  elements  for 
a  more  accurate  estimation  of  tne  attrition  rate  on  the 
disaggregate  level- 
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III.    COBBENT    SYSTEM 

A.       0)}£fiVI£H 

In  order  to  gain  an  appreciation  for  the  complexity  of 
the  Manpower  Planning  process,  the  author  during  his  experi- 
ence tour  at  Headquarters  Marine  Corps  had  the  opportunity 
to  contact  key  individuals  responsible  for  the  execution  of 
tne  mcdels  currently  utilized  within  the  Manpower  depart- 
ment. Ihe  methodology  used  to  identify  the  current  system 
included  interviews  with  the  action  officers  of  the  Officers 
Plans  Section  (Code  MPP-30) ,  briefings  and  interviews  with 
representatives  from  Allocations  Section  of  Manpower  Control 
Branch  (Code  MPC-20) ,  Officer  Assignment  Branch  (Code  MMOA) , 
Manpower  Plans,  Programs  and  Budget  Section  (Code  MPP-40)  , 
Manpower  Management  Information  Systems  Branch  (Code  MPI) , 
Assistant  Deputy  Chief  of  Staff  for  Manpower  (Code  M) ,  and 
examination  of  existing  manuals.  Requirements  Statement  for 
Officer  Planning  System,  and  Statement  of  Work  for  the 
Development  of  the  Officer  Inventory  and  Manpower  Flow  Data 
Base  System. 

Ihe  goal  of  the  officer  planning  process  is  to  shape  the 
officer  force  structure  over  the  7  planning  years  (current, 
budget,  and  five  Program  Objective  Memorandum  (POM)  years) . 
This  force  structure  is  affected  by  three  influencing 
factors:  existing  inventory,  projected  losses,  and  expected 
gains.  The  Officer  Plans  Section  (Code  MPP-30)  supports  the 
Manpower  Department  at  Headquarters  Marine  Corps  by 
preparing  plans  that  address  each  of  these  factors. 

The  plans  which  input  the  target  inventory  calculations 
are  the  Promotion  Plan,  the  Lateral  Move  and  Directed 
Lateral  Move  Plans,  the  Inventory  Projection,  and  the 
Manpower  Plan. 
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Losses  in  tne  officer  force  are  th.e  result  of  retire- 
ments, resignations,  releases,  discharges  and  other  factors 
(e.g.,  death).  Among  these  factors,  releases  and  aandatory 
(statutory)  retirements  addressed  in  the  Oficer  Inventory 
Control  Plan,  can  te  estimatea  with  a  high  degree  of 
certainty.  The  remaining  loss  factors  are  estimations  based 
on  forecasted  attrition  rates  utilizing  historical  data. 
All  of  the  factors  contributing  to  losses  in  the  officer 
inventory  are  addressed  in  the  Inventory  Projection  Loss 
Plan. 

Gains  in  officer  force  are  tne  result  of  accessions  and 
augmentation.  Accessions  are  identified  in  the  Accession 
Plan  ana  augmentations  are  identified  in  the  Augmentation 
Plan.  The  number  of  field  grade  officers  are  directly 
influenced  by  statutory  limitations  specified  in  Title  10, 
U.S.  Code.  These  limitations  affect  both  of  these  plans. 
In  the  planning  process,  in  order  to  plan  for  an  adequate 
number  of  field  grade  officers  in  the  future,  one  must 
currently  access  a  sufficient  number  of  second  lieutenants. 

The  force  structure  is  also  affected  by  the  mixture  of 
skills  of  the  officer  inventory.  The  plans  used  to  support 
the  required  mix  and  levels  of  skills  are  the  T3S/M0S 
Distribution  Plan,  the  Entry  Level  Training  Plan,  tne 
Lateral  Mover  Plan,  the  Continuation  Plan,  and  the  Promotion 
Plan. 

B.   SCUECES  OF  liFORHATIOH      -  - 

Input  from  the  following  external  sources  provide  infor- 
mation required  by  the  Officer  Plans  Section  to  create  the 
plans  identified  above. 

1.   Statutory  limitations   specified  in   Title  10,    U.S. 
Code. 
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2.  Desired  mventcry  force  structure  as  enumerated  in 
the   Grade    Adjusted   Recapitulation     (GAR). 

3-  FeedfcacJc  data  from  agencies  which  execute  the  plan 
(for  example,    actual    monthly   promotions) . 

4.  Historical  data  from  JUMPS/MMS  on  officer  service 
characteristics  and  any  changes  in  the  officer 
inventory    over    time. 

5.  Current  data  on  the  existing  officer  inventory 
utilizing  the  Officer  Slate  file  and  the  Lineal  List 
file. 

C.       FIOH    OF    INFOfiflATIOM 

The  planning  process  consists  of  three  major  elements 
comprised  of   several    processes.      These    elements    are: 

1.  Preparation  for  planning. 

2.  Plan  development. 

3.  Force  structure  analysis. 

Ihe  preparation  process  includes  development  of  the 
targeted  force  structure  to  include  distribution  of  officers 
by  skill,  sex,  unrestricted/restricted  categories,  and 
grades  based  on  historical  flow  rates  and  UStiC  policy  objec- 
tives. The  reguirements  definition  process  establishes  the 
authorized  number  of  officers  in  eacn  Military  Occupational 
Specialty  (HOS)  and  grade  for  the  five  planning  years  and 
determines  tne  target  force.  The  programmed  requirement  is 
specified  in  the  Numerically  Adjusted  Recapitulation  (NAR) . 

Ihe  planning  process  also  includes  the  determination  of 
officer  attrition  rates.  Currently  the  attrition  rate 
deter Bination  method  is  accomplished  in  conjunction  with  the 
Officer  Promotion  Planning  Process.  Tnis  model  divides  the 
attrition  rates  into  two  categories:  statutory  and  non- 
statutory. The  statutory  attrition  rate  results  from  USMC 
and   LOD  policies   and  is   applied  directly   to  the   officer 
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inventory.  The  non-statutory  rate  is  derived  from  a  time 
series  analysis  of  officer  inventory.  This  rate  is  tased  on 
historiccii  data  and  assumes  ccntinuation  of  present  trends. 
Currently  there  is  no  attempt  to  provide  predictive  rates 
based  on  econometric  techniques.  However,  such  an  approach 
is  being  researched  by  the  Navy  Personnel  Research  and 
Development  Center  for  use  by  the  Manpower  Department. 

The  development  process  includes  projection  of  the 
officer  inventory  and  generation  of  various  plans.  The  goal 
of  the  inventory  projection  process  is  to  provide  a  break- 
down by  month  of  the  number  of  officers  in  each  grade. 
Military  Occupational  Specialty  (MOS) ,  and  other  officer 
attributes  (for  example,  restricted,  years  of  commissioned 
service,  sex,  etc.).  This  breakdown  is  for  each  of  the 
planning  years.  Currently  the  inventory  projection  is 
deteruined  in  one  of  several  ways,  depending  on  the  plan 
being  developed.  The  Promotion  Planning  Process  projection 
is  utilized  if  the  level  of  aggregation  is  adequate  for  the 
plan.  Otherwise,  manual  calculations  are  used  to  aggregate 
from  tne  promotion  planning  process  inventory  projection  or 
to  generate  a  new  projection. 

The  plan  generation  process  is  complicated  by  the  inter- 
dependence of  the  plans  and  the  timing  of  their  production. 
Table  VI  identifies  the  plan  generation  process  including 
each  plan,  the  frequency  of  the  generation  and  update  of 
each  plan,  the  users,  and  the  sources  of  data. 

The  force  structure  analysis  process  involves  verifica- 
tion cf  the  generated  plans  and  evaluation  of  their  quality. 
Comparison  is  made  cf  the  results  of  the  planning  process 
with  the  USMC  objective  force  specified  by  the  Grade 
Adjusted  Recapitulation  (GAR)  and  the  promotion  flew  points 
specified  by  Defense  Officer  Personnel  Management  Act 
(DOPMA)  . 
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TABLE  ?I 
OFFICER  PLAN  GENEBATION  SOMMARI 


PLAN 


S  0  DfiCES 
GENERATED       UPDATED       D5ED    BY  OF    DATA 


Prcmotion    plan        Annual 


Monthly      Prom.    Brd    1,2,3,4, 
MPP-30  10 

RES 
MMPR 


Augmentation 
Plan 

Annual 

Monthly 

MPP-30 

MMOA 

ORB 

1^2,3,4, 

Inventory 
Projection 

Annual 

Annual 

MPP-30 

1,2,3,4, 
5,b,7,8, 

Inventory   Proj. 
Loss    Plan 

Annual 

Annual 

MPP-30 

1,2,7 

Continuation 
Plan 

Annual 

Annual 

MPP-30 

1.2,4,5 

Accession    Plan 

Annual 

Monthly 

MPP-30 
MMRD 

l:i''''' 

Entry   Level 
Training  Plan 

Annual 

Monthly 

MMOA 

TBS 

T 

1,2,4 

TBS/MCS 

Distribution 

Plan 

Annual 

Monthly 

MMOA 
IBS 

T 

1,2,4 

Lateral    Move 
Plan 

Annual 

Monthly 

MPP-30 
MMOA 

1,2,4,5 

Manpower   Plan 

Annual 

Monthly 

MPP-30 
MPP-40 

1,2,3,4, 
b,6,8,9 

Officer    control 
Plan 

Annual 

Monthly 

MPP-30 

1,2,4,5 

Source  of   Data  Codes 


1.  HMF 

2.  Attrition  Rates 

3.  DOPMA 

4.  GAR 

5.  Budget 

b.  Candidate  List 


7.  JUflPS/MMS 

8.  WO  Accessions 

9.  Retirement/Separation 

10.  Lineal  List 

11.  Oificer  Slate  File 


I 
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D.   PLANS 

Promotion  plan  is  generated  by  tne  use  of  the  Promotion 
Planning  Process  model.  The  model's  output  is  a  statistical 
projection  of  the  officer  inventory  over  a  period  of  up  to 
10  years.  It  also  provides  several  built  in  promotion 
strategies. 

Augmentation  Plan  is  generated  to  provide  the  Officer 
fietention  Board  (ORB)  with  information  on  the  vacancies 
available  in  each  fiscal  year  of  the  planning  process 
regarding  officers  in  a  category  (i.e..  Aviation,  Ground, 
and  Naval  Flight  Officer)  with  a  certain  number  of  years  of 
commissioned  service. 

Inventory  Projection  Plan  is  generated  for  a  five  year 
projection  and  is  specified  for  each  year  by  grade,  years  of 
commissioned  service,  and  category  (unrestricted  or  Limited 
Duty  Officer  (LDO)) . 

Inventory  Projection  Loss  Plan  contains  data  on  expected 
losses  from  the  inventory  projection  process.  The  plan  is 
utilized  to  indicate  wnich  year  of  comoiissioned  service  the 
losses   are  most  likely  to  occur. 

Officer  Inventory  Control  Plan  provides  information  on 
the  number  of  reserve  officers  whose  contracts  expires 
daring  the  fiscal  year.  It  is  influenced  by  the 
Augmentation  Plan  and  provides  guidance  on  the  number  of  the 
officers  who  may  be  retained. 

Continuation  Plan  specifies  the  number  of  officers  tnat 
may  be  granted  an  extension  of  active  duty  depending  upon 
tneir  Military  Occupational  Specialty  (MOS) . 

Accession  Plan  identifies  the  desired  number  of  incoming 
potential  officers  and  receives  as  input  the  number  of 
vacancies  for  each  year  of  the  planning  process.  Since 
accessions  into  the  USMC  are  by  officer  candidates  who  have 
usually  signed  contracts  several   years  prior  to  graduation. 
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the  pcoi  of  accessions  for  the  first  turee  years  are  gener- 
ally well  known.  The  goals  tor  the  remaining  planning  years 
are  established  on  projected  officer  losses  and  on  budgeted 
end  strengths. 

Entry  Level  Training  Plan  and  the  T3S/M0S  Distribution 
Plan  ensure  that  the  entry  level  officers  have  the  mix  of 
Military  Occupational  Specialties  that  is  needed  to  fully 
support  the  force  structure. 

Lateral  Hove  Plan  provides  information  on  the  shortage 
or  overage  in  each  Military  Occupational  Specialty  by 
year-group. 

Manpower  Plan  is  a  summary  of  gains  and  losses  in  each 
grade . 
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17.  HAEIIE  COfiPS  OFFICEB  ATTRITION  MODEL 

A-   DEFIHITION 

A  delinition  for  a  manpower  model  offered  i>y  Bartholomew 
is: 


A  manpower  model  is  a  matnematical  description  of  how 
cnaoge  taxes  place  in  the  system.  First  of  all  this 
requires  the  specification  of  any  constraints  under 
Wiiich  the  system  operates  ....  Secondly,  a  model 
mast  specify  the  mechanism  which  generates  flows.  Some 
flows-  such  as  promotion  or  demotion,  are  under  the 
direct  control  of  management  ....  Other  flows,  such 
as  voluntary  wastage,  are  not  under  direct  management 
control  and  assumptions  about  their  future  levels  is 
likely  to  be  basea  on  a  blend  of  historical  data  and 
management  judgement.   [Ref.  1:  p.  7] 


B.   ASSUHPTIONS 

Assunptions  regarding  flows  of  personnel  within  a  system 
can  be  classified  in  various  ways.  If  a  manpower  policy 
dictates  that  a  specific  percentage  of  individuals  in  a 
particular  category  would  leave  in  a  given  time  period  then 
the  assamption  about  the  flow  would  be  deterministic.  Given 
a  level  or  inventory  in  the  category  under  the  deterministic 
assumption,  then  there  is  no  uncertainty  about  how  many 
individuals  will  actually  leave.  A  deterministic  model 
contains  no  random  variables  and  there  is  a  unique  set  of 
model  output  data  for  a  given  set  of  inputs.   [  Eef .  3:  p.  3] 

However,  if  each  individual  in  a  category  had  a  certain 
probability  of  leaving  in  a  given  time  period,  then  the 
assumption  about  the  flow  would  be  stochastic.  Given  a 
level  of  inventory  in  tne  category  under  the  stochastic 
assumption,   then  a  prediction  of   the  number  of  individuals 
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leaving  would  not  be  precise.  A  stochastic  model  contains 
one  or  more  random  variables.  The  output  of  a  stochastic 
model  is  random  and  thus  only  estimates  the  true  character- 
istics of  the  system  [fief-  3:  p.  3].  The  output  of  a 
stochastic  luodel  is  a  probability  distribution.  If  individ- 
uals Kithin  a  system  behave  independently  regarding  their 
decision  of  leaving,  the  actual  number  of  leavers  has  a 
binomial  distribution  wnose  average  is  a  certain  percentage 
of  the  inventory  within  a  category  [fief.  1:  p.  7]. 
Voluntary  attrition  from  the  Marine  Corps  is  a  stochastic 
flow  since  it  results  from  a  number  of  more  or  less  indepen- 
dent individual  decisions, 

C.   AITEITION  BATES 

A  fundamental  role  in  manpower  analysis  is  played  by 
attrition  rates.  The  time  period  can  be  any  duration.  The 
most  ccmmonly  used  periods  are  months  and  years.  The  use  of 
tne  central  attrition  rate  when  the  categories  are  defined 
with  respect  to  length  of  service  can  be  considered  as  esti- 
mating the  constant  rate  of  leaving  or  separation  during  a 
year.  Using  this  method,  the  standard  error  of  the  central 
attrition  rate  can  be  estimated  if  some  assumptions  are  made 
about  the  attrition  process: 

1  .  The  number  in  the  category  throughout  the  year  was 
constant  and  equal  to  the  average  number  in  tnis 
class  during  the  year. 

2.  Let,  m  =  the  central  attrition  rate. 

3.  Let,  S*  =  Average  number  in  a  category  during  the 
year. 

4.  Let,  L*  =  Number  of  leavers  during  tae  year  who  were 
in  this  category  wnen  they  left. 

5.  Then,  m  =  L*  /  S*. 
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6.   Each  individual  in  the   category  is  subject  to   the 

probability   (mdx)    of   leaving   in  each   interval 

(jc,x+dx).   [Hef-  1:  p.  25] 

Under   these      assumptions   the   number      of    losses      would   be 

distributed  Poisson     (S*in)  .       The   estimated    standard    deviation 

ox    the   central  attrition   rate    would  be:      [Ref-     1:    p.    25] 

sd (m)     =   m    /  sqrt  (L*) 

D.   HAEIHE  COHPS  OFFICER  CENTRAL  ATTRITION  RATES 

Seme  empirical  attrition  rates  are  provided  for  the 
purpose  of  familiarization.  First  are  macro  (large  aggre- 
gate) examples  and  proceed  progressively  to  more  refined 
categorized  examples.  An  example  of  the  estimated  central 
attrition  rates  and  their  estimated  standard  deviations  are 
provided  in  Table  VII.  The  average  inventory  over  the  seven 
year  period  is  also  provided  in  Table  VII  in  order  to 
illustrate  the  number  of  officers  within  each  MOS. 

Ihe  rates  in  Table  VII  are  an  aggregation  of  all  grades 
within  a  Military  Occupational  Specialty  (MOS)  and  ail  loss 
types.  Tne  time  period  includes  all  seven  years  cf  inven- 
tory counts  and  the  losses  for  fiscal  years  1977  to  1982. 
Ihe  central  attrition  rate  was  calculated  as  follows: 

1.  Let,  t  =  1...6,  covering  years  1977. ..1982. 

2.  Let,  Y  (t)  =  Number  of  losses  in  year  t. 

3.  Let,  INV(t)  =  Inventory  in  the  beginning  of  year  t. 

4.  Let,  N  (t)  =  Average  inventory  in  year  t,  using  the 
Beginning  inventory  in  year  t  and  t+1  and  computing 
their  average:   (INV  (t)  ♦INV  (t+ 1)  ) /2. 

5.  Let,  m  =  Sum  of  all  Y(t)  divided  by  sum  of  ail  N(t). 
The   central  attrition   for   each  Military   Occupational 

Specialty  is  illustrated  for  the  length  of  service  period 
from  0  to  30  in  Appendix  A,  in  figures  A.I  toA.10.  These 
figures  illustrate  tne  aggregate   central  attrition  rate  for 
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TABLE 

?II 

CENTfiAL 

ATTBITIOH    BATES 

AND    STANDAfiD    DEVIATIONS 

AVERAGE 

STANDARD 

MOS 

INVENTCfiY 

BATES 

DEVIATIONS 

01 

531 

.121 

.006 

02 

288 

.119 

.008 

03 

3059 

.091 

.002 

04 

183 

.104 

.010 

05 

1322 

.  10  1 

.004 

06 

36 

.080 

.019 

07 

657 

.  114 

.005 

08 

c 

.034 

.034 

09 

6 

.027 

.027 

10 

512 

.105 

.006 

11 

102 

.090 

.012 

12 

76 

.  100 

.015 

13 

782 

.  138 

.005 

14 

128 

.099 

.01  1 

15 

94 

.098 

.013 

16 

1148 

.  139 

.004 

17 

26 

.  156 

.032 

18 

40 

.120 

.022 

19 

309 

.  109 

.008 

20 

364 

.115 

.008 

21 

251 

.  143 

.010 

22 

22 

.156 

.035 

23 

59 

.150 

.020 

24 

459 

.193 

.008 

25 

2a 

.  124 

.029 

26 

17 

.08  1 

.029 

27 

1  1 

.047 

.033                           \ 

28 

156 

.093 

.010 

29 

1  19 

.  120 

.013 

30 

262 

.100 

.008 

31 

0 

.000 

.000 

32 

65 

.042 

.017 

33 

89 

.067 

.01  1 

34 

14 

.  108 

.036 

35 

32 

.088 

.021 

36 

526 

.125 

.006 

37 

108 

.091 

.012 

38 

5644 

.08  1 

.002 

39 

1296 

.  100 

.004 

TOTAL 

18780 

.  102 

.001 

all  eight  types  of  losses  over  the  six  years  from  1977  to 
1982.  These  figures  demonstrate  the  treakpoints  and  discon- 
tinuities within  each  MOS  over  the  distribution  of  length  of 
service.   One  can  very  easily  note  the  increased  rates  wnich 
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occur  at  certain  years  of  length  of  service.  The  increased 
rates  frequently  occur  at  the  four,  20,  and  30  years  of 
length  of  service.  Ihe  central  attrition  rate  for  each  MOS 
is  illustrated  for  the  grades  from  Warrant  Officer  to 
Colonel  (labelled  0...9  on  the  abscissa)  in  Appendix  A,  in 
Figures  A. 1 1  to  A, 20.  These  figures  also  illustrate  the 
aggregate  central  attrition  rate  for  all  eight  types  of 
losses  over  the  same  si^  years. 

The  central  attrition  rate  for  each  fiscal  year  aggre- 
gated ever  all  categories  is  provided  in  Table  VIII  and 
illustrated  in  Figure  4. 1.  The  trend  is  stable  for  the 
first  three  years  and  a  decreasing  trend  is  evidenced  in  the 
last  three  years. 


TABLE  VIII 
OSMC  OFFICEB  LOSS  BATES  PER  FISCAL  YEAB 

1977    1978    1978    1980    1981    1982 
.1133   .1137   .1136   .1012   .0928   .0781 


Table  IX  is  provided  in  order  to  gain  an  appreciation  of 
the  number  of  losses  within  each  loss  type.  This  table 
includes  the  number  of  losses  for  each  fiscal  year  1977 
through  1983  aggregated  over  all  grades,  LOS,  and  MOS.  This 
table  also  identifies  the  loss  types.  One  can  note  the 
large  nuniDer  of  losses  throughout  the  seven  years  in  types 
two,  four,  and  six.  Additionally,  after  reaching  a  total 
nigu  in  1978,  the  general  trend  is  a  decreasing  number  of 
losses. 


no 
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Figure  U.I    OSflC  Officer  Loss  Bates  Per  Fiscal  Year. 

The  central  attrition  rates  tor  this  level  or  aggregation 
lor  fiscal  years  1977  tnrough  1982  are  illustrutea  in  Taule 
X. 

The  central  attrition  rates  for  each  Military 
Occupational  Specialty  (MOS)  and  for  each  liscal  year  l'::>77 
through  1982  is  providea  in  laLle  il.  rhest  rates  are 
aggregated  over  all  less  types.  Tnese  rates  are  graphically 
illustrated  in  Appendix  A,  in  Figures  A. 21  to  A. 28.  One  can 
observe  tae  change  of  rates  over  time  for  eacn  ^<}S.         It  can 
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TiBLE  IX 
LOSSES  PEE  FISCAL  YEAR  BY  LOSS  TYPE 


LOSS 

TYPE 

1977 

1978 

1979 

1  980 

1981 

1982 

1983 

TOTAL 

LI 

118 

90 

109 

40 

24 

37 

35 

453 

L2 

675 

701 

681 

594 

53  1 

524 

362 

4068 

13 

32 

U2 

38 

39 

34 

22 

27 

234 

L4 

364 

515 

53  6 

572 

532 

431 

317 

3267 

15 

1 

1 

0 

0 

1 

0 

0 

3 

L6 

8sa 

772 

750 

595 

522 

406 

436 

4375 

17 

15 

27 

17 

2 

3 

2 

10 

76 

18 

52 

50 

39 

35 

43 

30 

4  1 

290 

TOTAL   2151  21S8  2170  1877  1690  1452  1228  12766 


LOSS  TYPES 

LI:  Voluntary  Resignation 

L2:  Voluntary  Retirement 

L3;  Involuntary  -  Death 

L4:  Involuntary  Discharge 

L5:  Involuntary  Disability 

L6:  Release  trom  Active  Duty 

L7:  Disability  Retirement 

L8:  Involuntary  Retirement 


be  noted  with  interest  that  the  MOS's  witn  larger  invento- 
ries (refer  to  Table  VII  for  the  average  inventory  witnin 
each  «CS)  do  not  change  as  much  over  time  as  the  MOS's  with 
smaller  inventories.  For  example,  the  infantry  MOS  03  has 
the  secona  largest  inventory  count.  The  range  of  the  rates 
over  time  fluctuate  from  a  low  of  0.070  to  a  high  of  0.106 
giving  a  range  of  0.036.  The  logistic  MOS  04,  on  the  otner 
aand,  has  a  much  smali.er  inventory  count  and  the  rates  over 
time  fluctuate  from  a  low  of  0.049  to  a  high  of  0.22,  giving 
a  range  of  0.171.  In  ether  words,  the  total  inventory  count 
in  a  category  nas  a  definite  effect  on  the  change  of  rates 
over    time. 
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TABLE  X 
CEBTBAL  ATTEITIOH  BATE  PER  FISCAL  YEAE  BY  LOSS  TYPE 


LOSS 

TYPE 

1977 

1978 

1979 

1980 

1981 

1982 

LI 

.006 

.005 

.  006 

.002 

.001 

.002 

L2 

.03  0 

.036 

.  035 

.031 

.028 

.028 

L3 

.00  2 

.002 

.  002 

.002 

.002 

.001 

L4 

.019 

.027 

.  028 

.030 

.028 

.023 

L5 

.00  0 

.000 

.  000 

.000 

.000 

.000 

L6 

.047 

.040 

.  039 

.031 

.027 

.022 

L7 

.001 

.001 

.  001 

.000 

.000 

.000 

L8 

.00  3 

.003 

.002 

.002 

.002 

.002 

Ihe  change  in  the  rates  are  also  affected  ty  the  level 
of  disaggregation.  The  central  attrition  rates  for  the 
major  Military  Occupational  Specialty  groups  (Aviation, 
Comoat  Support,  Ground  Combat  and  total)  are  listed  in  Table 
XII.  These  major  groups  are  illustrated  in  Appendix  A,  in 
Figures  A. 29,  A. 30,  A. 31,  and  A. 32.  Tne  rates  demonstrate 
the  aggregation  over  all  types  of  losses.  For  brevity  the 
illustrations  were  limited  to  the  grades  or  Second 
Lieutenant  to  Lieutenant  Colonel. 

A  further  disaggregation  is  listed  in  Table  XIII  and 
illustrated  in  Figure  4.2.  These  rates  demonstrate  the 
aggregation  within  the  Combat  Support  group  over  all  lengths 
of  service  (LOS)  for  tne  voluntary  retirements  (type  2  loss) 
for  the  grades  of  First  Lieutenant,  Captain,  Major  and 
Lieutenant  Colonel.  A  common  observation  within  this  loss 
type  is  the  increased  rate  as  tne  grade  increases.  This 
pheacnenon  also  corresponds  to  the  increased  rate  of 
attrition  in  the  larger  LOS  categories. 

Another  level  of  disaggregation  of  the  Combat  Support 
group  is  listed  in  Table  XIV  and  illustrated  in  Figure  4.3. 
These   rates  demonstrate   the  aggregation   within  the   Motor 
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TABLE    XI 

CENTBAL    ATTEITION    RATES    FOR 

FISCAL 

YEARS 

BY    MOS 

C1CS 

1977 

1978 

1979 

1980 

1981 

1982 

1 

.  13  2 

.  137 

.  134 

.  132 

.  105 

.092 

2 

.  136 

.  151 

.  11  1 

.  112 

.120 

.093 

3 

.  106 

.092 

.098 

.08  8 

.086 

.070 

4 

.049 

.220 

.079 

.097 

.128 

.085 

5 

.  110 

.  103 

.  120 

.  118 

.079 

.073 

6 

.057 

.  121 

.  13  1 

.029 

.133 

.024 

7 

.  128 

.  119 

.  113 

.  109 

.  124 

.087 

8 

.286 

.000 

.000 

.000 

.000 

.000 

9 

.000 

.000 

.200 

.000 

.0  00 

.000 

10 

.  135 

.097 

.  150 

.09^ 

.091 

.058 

11 

.  109 

.078 

.110 

.  103 

.078 

.065 

12 

.068 

.  12t> 

.  113 

.  132 

.079 

,079 

13 

.  131 

.  152 

.  153 

.  145 

.132 

.  112 

14 

.  128 

.  158 

.  134 

.079 

.063 

.076 

15 

.  127 

.  167 

.125 

.  115 

.049 

-043 

16 

.  162 

.  139 

.  160 

.  142 

.111 

.  113 

17 

.172 

.  151 

.208 

.  120 

.077 

.213 

18 

.  123 

.  173 

.  100 

.099 

.  146 

.076 

19 

.  164 

.  113 

.096 

.  112 

.084 

.107 

20 

.  105 

.  137 

.  108 

.  148 

.  116 

.081 

21 

.  123 

.  178 

.  140 

.170 

.130 

.  118 

22 

.  192 

.078 

.250 

.270 

.063 

.056 

23 

.211 

.  192 

.087 

.  164 

.  113 

.171 

24 

.24  4 

.209 

.  195 

.  192 

.  154 

.148 

25 

.  133 

.  130 

.078 

.  145 

.  122 

.133 

26 

.06  5 

.067 

.  194 

.059 

.056 

.056 

27 

.000 

.000 

.000 

.000 

.000 

.091 

28 

.093 

.  102 

.062 

.115 

.  104 

.083 

29 

.  106 

.  165 

.  137 

.  140 

.113 

.049 

30 

.086 

.  119 

.083 

.  101 

.  120 

.089 

31 

.000 

.000 

.000 

.000 

.000 

.000 

32 

.000 

.000 

.000 

.000 

.065 

.031 

33 

.072 

.047 

.044 

.  086 

.043 

.  110 

34 

.  146 

.000 

.20  7 

.074 

.074 

,  148 

35 

.  151 

.066 

.030 

.  114 

.029 

.154 

36 

.  159 

.  139 

.  139 

.098 

.111 

.094 

37 

.  165 

.  113 

.045 

.  103 

.051 

.076 

38 

.086 

.  102 

.093 

.07  5 

.073 

.056 

39 

.090 

.093 

.  123 

.089 

.1  1U 

.0  94 

Transportation  MOS  ever  all  LCS  for  tne  voluntary  retire- 
ments (type  2  loss)  for  the  grades  of  First  Lieutenant, 
Captain,  Major,  and  Lieutenant  Colonel.  This  illustrates 
the  irregular  distritution  resulting  from  voluntary  attri- 
tion. Ihe  number  of  losses,  even  with  aggregation  ever  LOS, 
13  relatively  small.  The  actual  losses  in  tnis  tatle  range 
from  2ero  to  seven. 
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TABLE    XII 
CENIBAL    AITfilTION    BATES 

AVIATION     (MOS    38) 
GRADE       1977       1978       1979       1980       1981       1982 


2LT 

.03  0 

.020 

.031 

.026 

.006 

.017 

1LT 

.05  5 

.082 

.032 

.025 

.029 

.018 

CAPT 

.148 

.  173 

.201 

.157 

.  163 

.096 

MAJ 

.068 

.077 

.  06  9 

.049 

.047 

.045 

LICOL 

.  125 

.  171 

.  144 

.  145 

.108 

.145 

COMBAT    SUPPORT    GROUP     (MOS    07,13,20) 
GRADE       1977      1978       1979      1980       1981       1982 
2LT  .003      .025      .023      .009       .010       .015 


1LT 

.249 

.240 

.232 

.  187 

.197 

.142 

CAPT 

.082 

.082 

.  104 

.125 

.  106 

.080 

MAJ 

.123 

.  119 

.  107 

.145 

.125 

.080 

ITCOL 

.136 

.  193 

.  114 

.155 

.154 

.  114 

GROUND    COMBAT    GROUP     (MOS    03,05,10) 
GRADE       1977       1978       1979       1980       1981       1982 


2LT 

.010 

.016 

.  Oil 

.008 

.013 

.011 

1LT 

.23  1 

.190 

.220 

.151 

.137 

.121 

CAPT 

.08  1 

.059 

.  074 

.093 

.090 

.065 

MAJ 

.07  4 

.051 

.060 

.060 

.059 

.032 

LICOL 

.112 

.  129 

.  131 

.  139 

.100 

.  109 

TOTAL    MOS 
GnADE       1977       1978       1979      1980       1981       1982 


2LT 

.035 

.032 

.038 

.025 

.020 

.026 

1LT 

.17  2 

.161 

.  146 

.1  18 

.111 

.093 

CAPT 

.129 

.  128 

.  140 

.  133 

.  123 

.088 

MAJ 

.099 

.095 

.  099 

.034 

.075 

.056 

LTCOL 

.  119 

.  154 

.  155 

.  146 

.  110 

.  130 

A    further    disaggregation   tc   include   each  LOS    (not   illus- 
trated),   would  demonstrate   the   effect    of    a   smaller    number  of 
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TABLE    XIII 

)0P      VOL.    H] 
CENTBAL    ATTRITION    BAT] 

GEADE       1977       1978       1979       1980       1981        1982 


COHBAT    SOPPOBT    GfiOOP      VOL.    BET.     (LOSS    TYPE    2) 

---is 


ILT 

.00  4 

.005 

.  000 

.004 

.002 

.004 

CAPT 

.031 

.017 

.  023 

.017 

.021 

.010 

MAJ 

.  103 

.  105 

.  092 

.1  13 

.090 

.068 

LTCOL 

.122 

.  178 

.  114 

.147 

.145 

.  114 

elements  within  a  cell.  This  is  particularly  evident  when 
disaggregation  is  within  a  sparsely  populated  MOS  and  the 
desired  rate  is  at  a  particular  grade,  LOS,  and  loss  type 
combination. 
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TABLE    XI7 

;    20)       VOL. 
CEBlHAL    ATTRITION    HATES 

GhkDE       1977       1978       1979       1980        1981        1982 


flOTOfi    TfiANSPORTJMOS    20)     _?0L.    RET.     (LOSS    TYPE    2) 


1LT 

.00  0 

.023 

.000 

.0  11 

.000 

.000 

CAPT 

.164 

.049 

.  043 

.047 

.022 

.021 

MAJ 

.000 

.298 

.  16  9 

.2j5 

.041 

.107 

LTCOL 

.000 

.857 

.  286 

.222 

.000 

.000 
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V.  JAHES-STEIH  TJCHNIfiOES  iO  fALIDATIQl!  PEQCEDDBES 

A.   BACKGROOND 

The  first  basic  process  in  statistics  is  the  simple  act 
of  counting  and  the  second  most  basic  process  is  averaging. 
One  use  of  observed  averages  is  to  predict  or  estimate  unob- 
served quantities.  For  example,  a  baseball  player  who  gets 
eignt  hits  in  25  times  at  bat  is  said  to  have  a  .320  batting 
average,  which  is  the  ratio  of  8  to  25.  In  computing  this 
statistic  ai.  estimate  is  formed  of  the  player's  true  batting 
aoility  in  terms  of  his  observed  average  rate  of  success  at 
the  plate.  A  common  reply  to  the  inquiry  regarding  tnis 
baseball  player's  next  100  times  to  bat  would  be  that  he 
would  probably  get  32  more  hits.  In  traditional  statistical 
theory,  no  other  estimation  rule  is  uniformly  better  than 
the  observed  average.  The  attrition  in  cells  is  modeled  as 
a  set  of  Bernoulli  trials  whose  numoer  is  the  inventory  of 
that  cell.  The  process  of  choosing  leavers  is  viewed  as 
independent  from  individual  to  individual.  A  convient 
anology  might  be  that  of  a  baseball  players  batting  average 
in  which  each  time  at  bat  is  viewed  as  an  independent 
Bernoulli  trial. 

Osing  tiie  above  Bernoulli  (Binomial)  model,  the  rates 
(or  batting  averages)  are  the  Maximum  Likelihood  estimators 
of  the  Bernoulli  parameter.  The  maximum  liKelihood  proce- 
dure examines  the  likelihood  function  of  the  sample  values 
and  takes  as  the  estimates  of  tne  unknown  parameters  those 
values  that  maximize  the  likelinood  function  [Bef.  4:  p. 
363]-  These  estimates  are  optimal  (m  particular  admis- 
sible) if  we  are  talking  about  one  cell  (or  one  ball 
player)  .  But  it  is  no  longer  true  if  we  are  dealing  witn 
three  or  more  cells  (cr  ball  players). 


50 


Charles  Stein  [fief.  5]  showed  in  1955  that  it  is 
possible  to  make  a  uniform  improvement  on  the  iiaxiaiam  like- 
lihood estimator  (MLE)  in  terms  of  total  squared  error  risk 
when  estimating  several  means  from  independent  normal  popu- 
lations. In  19t>1,  James  and  Stein  [Ref.  6]  presented  to  the 
Berkeley  Symposium  an  estimator  which  shrinks  the  maximum 
likelihood  estimator  toward  the  origin,  and  can  improve 
(i.e.  lower  risk)  on  the  maximum  likelihood  estimator  quite 
substantially  provided  there  are  three  or  more  cell.  To 
continue  the  baseball  example,  if  one  considers  three  or 
more  baseball  players  and  desires  to  predict  future  batting 
averages  for  each  of  them,  then  there  is  a  procedure  that  is 
better  than  simply  extrapolating  from  the  separate  averages. 
Efron  and  Morris  [Ref.  7]  demonstrated  an  application  of  the 
Stein  rule  and  its  generalizations  applied  to  predict  base- 
ball averages,  to  estimate  toxomosis  prevalence  rates,  and 
to  estimate  the  exact  size  of  Pearson's  chi-square  test  with 
results    from      a   computer   simulation.  The   result      of    tiiese 

applications  was  a  mean  square  error  of  the  Stein  rule  being 
less  than  naif  of  that  when  one  uses  the  individual 
empirical   averages. 

The  first  step  in  applying  Stein's  method  is  to  deter- 
mine the  average  of  the  averages.  This  grand  average  or 
grand  mean  must  lie  between  0  and  1.  The  essential  process 
in  Stein's  method  is  the  "shrinking"  of  all  the  indiviaual 
averages  toward  this  grand  mean,  wnen  the  Stein's  metnod  is 
applied    to   attrition    rates,    tne   results    are    as    follows. 

1.  If   an    aggregated      rate   is   nigher    than     the   grand    mean 
then   it   will    be   reduced. 

2.  If  an      aggregated   rate    is      lower    than  the      grand   mean 
then    it    will    be   increased. 

Figure  5. 1  is  a  simple  illustration  of  this  "shrinking" 
concept  utilizing  fictitious  values  for  the  empirical 
values.  In  this  simple  illustration  the  effective  shrinkage 
is    2  5    percent. 
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B.       JABES-STEIN    ESTIMATOR 

Ihe  James-Stein  estimator  for  eacn  desired  cell  is  found 
through    the  following   equation. 

1.  Let    Y*    =   the    grand    mean. 

2.  Let    m    =   sample    attrition    rate    for    each   cell. 

3.  Then,  p  =  Y*  +  c(m-y*) ,  for  a  specialized  value  of 
"c" ,    is   the  James-Stein    estimator. 

See  Appendix  B,  for  the  James-Stein  estimation  algo- 
rithm. See  Appendix  C,  for  more  mathematical  details  on  the 
equation   used      for   the      James-Stein   estimator.  The   amount 

(m-Y*)  is  the  amount  by  which  the  sample  attrition  rate 
differs  from  tne  grand  mean  of  the  desired  cells.  The  equa- 
tion thus  states  that  the  James-Stein  estimator  "p"  differs 
from  tne  grand  mean  by  this  same  quantity  (m-Y*)  multiplied 
by      a      coefficient,         "c".  The      coefficient      "c"      is      the 

"shrinking  factor"  used  in  the  James-Stein  estimation 
process.  If      "c"    was      equal      to      1,      then      the      resultant 

James-Stein  estimator  for  a  given  cell  would  be  identical  to 
the    cell's      sample  attrition   rate.  In    other      words,      p=m. 

Stein's  theorem  [Ref.  6]  states  that  the  shrinking  factor  is 
always  less  than  1.  Its  actual  value  is  determined  by  tne 
collection    of    all   observed   attrition   rates. 

As  an  example,  if  the  shrinking  factor  "c"  was  equal  to 
.3,  then  each  attrition  rate  will  shrink  about  70  percent  of 
the    distance    to    the    grand    mean.      Three    examples    follow: 

1.  Assume  the  sample  attrition  rate  was  .10,  the  grand 
mean  was  .05,  and  "c"  equal  to  .3.  Then 
p=. 05+.3 {. 10- .05) .  Tne  result  is  .065.  By  Stein's 
theorem,  the  attrition  rate  is  best  estimated  by  .065 
rather  than  the  historical  sample  attrition  rate. 
Shrinkage    was    down    toward   the    grand    mean. 

2.  Assume  the  sample  attrition  rate  was  .50,  the  grand 
mean      was        .05,         and      "c"        equal      to         .3.  Then 
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p=.05+.3 (.50-,05) .         The   result    is       .185.         The   esti- 
mator     is    smaller      than      the      sample   attrition      rate. 
Shrinkage    was    dcwn    toward  the   grand   mean. 
3.       Assume   tne    sample   attrition    rate      was    .05,      the    grand 
mean      was         .10,         and      "c"        equal      to         .7.  Then 

p=. 10  +  .7  (.05- .  10) .  The  result  is  .065,  Shrinkage 
was  up  toward  the  grand  mean. 
There  are  several  expressions  for  the  James-Stein  esti- 
mator, tut  all  of  them  have  in  common  a  shrinking  factor 
"c".  without  exhibiting  any  of  the  various  formulas  for 
"c",  the  following  is  a  description  of  their  general 
behavior.  See  TaDle  XV  for  the  James-Stein  formula  utilized 
in  this  project.  See  Appendix  C  for  further  background  and 
formulas. 

Let  "K"  be  the  number  of  unknown  means;  let  sigma  be  the 
within  population  standard  deviation;  and  let  SSB  be  tne  sum 
of  square  of  cell  means  measured  from  the  grand  mean- 
Holdicg  sigma  and  "K"  constant  in  the  formula  for  "c",  the 
shrinking  factor  is  affected  by  SS3.  Tne  shrinking  factor 
"c"  becomes  smaller  as  SSB  gets  smaller.  The  predicted 
means  are  more  affected  cy  this  condition  and  the  resultant 
Clean  is  closer  to  the  grand  mean.  On  tne  other  hand,  as  SSB 
increases,  "c"  increases  and  the  shrinkage  is  less  drastic. 
In  effect  the  James-Stein  procedure  makes  a  preliminary 
estimate  that  all  the  unobservable  means  are  near  the  grand 
mean.  If  tne  data  supports  the  condition  wherein  the  sample 
means  are  not  too  far  from  tne  grand  mean,  then  the  esti- 
mates are  all  snrunk  further  toward  tne  grand  mean.  If  this 
condition  is  contradicted,  then  the  shrinking  is  minimal. 
The  distribution  of  the  sample  means  around  the  grand  mean 
effects  tne  shrinking  factor.  The  numcer  of  means  being 
estimated  also  influences  the  shrinking  factor.  If  "K"  is 
rather  large,  the  shrinking  to  the  grand  mean  maybe  more 
drastic.      [Ref.    8:    p.    123] 
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Which  set  of  values,  "m"  or  "p",  is  the  better  indicator 
of  the  attrition  rate?  In  order  to  answer  that  question  one 
would  have  to  know  the  true  attrition  rate.  This  true 
average  shall  be  designated  with  "TH"  in  the  main  text  of 
this  paper  ana  by  tne  Greek  letter  theta  in  the  Appendices. 
It  IS  the  probability  that  an  officer  would  leave  any  time 
and  is  actually  an  unknowable  quantity.  Although  "TH"  is 
unoDservable,  an  approximation  can  be  made  utilizing 
historical  data. 

One  method  of  evaluating  the  two  estimates,  i.e.,  the 
James-Stein  estimator  and  the  observed  attrition  rate  m,  is 
by  simply  counting  their  closeness  to  "TH".  Consider  the 
closer  the  estimator  is  to  "TH"  as  a  success  and  the  farther 
from  "TH"  as  a  failure.  The  number  of  successes  of  one 
estimator  could  then  te  compared  to  the  numoer  of  successes 
of  the  ether  estimator. 

A  more  quantitative  method  of  comparing  the  two  esti- 
mates is  through  the  total  squared  error  estimation.  This 
is  measured  by  first  determining  the  actual  error  of  each 
prediction  ,  given  by  (TH-m)  and  (TH-p)  for  each  cell.  Each 
of  these  quantities  is  then  squared  and  the  squared  values 
are  added  up.  The  observed  averages  "m"  would  have  a  total 
squared  error  denoted  by  Em,  whereas  the  squared  error  of 
the  James-Stein  estimators  would  be  denoted  by  £js. 
Comparing  Em  and  Ejs  by  the  ratio  R=Em/Ejs,  then  the 
James-Stein  method  is  "R"  times  as  accurate  as  the  observed 
attrition  rate  "m". 

Employing  the  ideas  borrowed  from  statistical  decision 
theory,  the  estimators  can  be  compared  through  a  risk  func- 
tion. The  risk  function  of  a  decision  rule  is  the  expected 
loss  (over  the  sample)  incurred  using  the  decision  rule 
[Ref.  9:  p.  8].  The  above  techniques  are  similar  to  the 
squared-error  loss  method  [Bef.  9:  p.  9]. 
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Therefore,  the  risk  function  is  the  expected  value  of 
the  squared  error  for  every  possible  value  of  "TH".  Stein's 
theorem  is  concerned  with  estimation  of  several  unknown 
means  [ Bef .  7].  No  relation  between  the  means  need  be 
assumed  and  indeed  are  assumed  to  be  independent  of  one 
another.  The  historical  development  of  statistical  theory 
from  Gauss  through  decision  theory  argues  that  the  average 
is  an  admissible  estimator  as  long  as  there  is  just  one  or 
two  means  to  be  estimated.  An  estimator  for  a  parameter  is 
admissible  if,  according  to  a  specified  criterion,  tnere 
exists  no  other  estimator  that  is  better  than  it  for  all 
possitle  values  of  the  parameter.  Stein  in  collaboration 
with  James  showed  that  when  the  number  of  means  exceeds  two 
that  estimating  each  cf  the  means  by  their  own  average  is  an 
inadmissible  procedure.  No  matter  what  the  values  of  true 
mean,  there  are  estimation  rules  with  smaller  total  risk 
[Ref.    6]. 

The  risk  function  for  the  James-Stein  estimator  is  less 
than  the  risk  function  for  the  sample  means  irrespective 
what  the  true  values  of  the  means  "TH"  happen  to  te.  The 
reduction  of  risk  can  be  substantial,  particularly  when  the 
number      of    means      is      relatively   large.  Tne      risk    of      the 

James-Stein  estimator  is  smallest  wnen  all  the  true  means 
"TH"  are  equal.  As  the  true  means  increase  in  variation 
from  one  another  the  risk  of  the  estimator  increases, 
approacning  the  value  of  the  observed  averages  tut  never 
quite  equaling  it.  The  James-Stein  estimator  does  suDStan- 
tialiy  tetter  than  the  averages  only  if  the  true  means  lie 
near    each   other.  The  James-Stein   estimator   does      at    least 

marginally  better  no  matter  what  the  true  means  are  [Ref.  8: 
p.     12^]. 
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C.       CTHEE    JAHES-STEIM    HODELS 

The  model  for  the  James-Stein  estimator  used  thas  far 
shrinks  the  observed  averages  to  tne  grand  mean.  This  is 
not  the  only  possible  procedure.  Other  models  for  the  esti- 
mator dispense  with  the  grand  mean  entirely.  The  observed 
attrition  rates  do  not  depend  on  a  choice  of  origin.  Before 
Stein  discovered  his  method  it  was  generally  accepted  that 
such  "invariant"  estimators  must  be  preferable  to  tnose 
whose  predictions  change  with  each  choice  of  an  origin.  If 
the  origin  or  zero  is  chosen  as  the  grand  mean  then  the 
terms  containing  the  grand  mean  would  be  removed  from  the 
equation.  In  this  case  the  James-Stein  estimator  would  be 
p=c(m).  The  estimation  process  is  now  complicated  by  the 
fact  that  the  shrinking  factor  "c"  would  be  different  for 
each  cell.  The  shrinking  factor  is  dependent  on  the  stan- 
dard deviation  of  the  sample  attrition  rates.  A  large  stan- 
dard deviation  implies  a  high  degree  of  randomness  or 
uncertainty  in  the  sample  data.  If  the  sample  attrition  is 
large,  it  can  be  attributed  to  random  fluctuations  rather 
than  to  an  actual  large  value  of  the  true  mean  "TH".  Thus, 
application  of  a  small  shrinking  factor  would  reduce  the 
value.      [Hef.    7:     pp.     123-U] 

There  is  one  purpose  for  which  the  measured  sample 
attrition  rate  may  well  be  superior  to  the  Jaies-Stein  esti- 
mator: wnen  a  single  cell  is  considered  in  isolation.  The 
James-Stein  method  gives  better  estimates  for  a  majority  of 
cells  and  it  reduces  the  total  error  of  estimation  for  the 
sum  of  all  the  cells.  Estimating  the  true  mean  for  an 
isolated  cell  by  Stein's  method  creates  serious  errors  when 
that  mean  has  an  atypical  value.  The  reduction  of  the  risk 
is  more  predominant  in  a  nomogeneous  setting.  Tne  inclusion 
of  an  atypical  or  ncnhomogeneous  mean  in  the  estimation 
process    would    increase    the    risk. 
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1 •      Bayesian    versus  James-Stein 

The  formula  for  the  James-Stem  is  similar  to  the 
Bayesian  equation  of  Z=m+C{y-ni).  Here  "y"  is  the  sample 
mean,         "m"   is      the    mean      of   the      prior   distribution.  The 

shrinking  factor  "C"  is  different  in  that  it  depends  on  the 
standard  deviation  of  the  prior  distribution.  [Bef.  9:  pp. 
99-110] 

ine  James-Stein  procedure,  however,  has  one  impor- 
tant advantage  over  the  pure  Bayesian  method.  The 
James-Stein  estimator  can  be  employed  without  knowledge  of 
the  pricr  distribution.  Tne  James-Stein  estimator  can  be 
regarded  as  an  empirical  Hayes  rule.  The  empirical  Bayes 
approach  uses  historical  data  to  estimate  the  prior 
distribution  [fief.    9:    p.     117]. 

2 .      Modifications    to    th e   James-Stein    Process 

The  James-Stein  estimation  process  can  be  modified 
using  the  Efron-Morris  limited  translation  version  of  the 
estimator  [Ref.    10]-  This   modification      ensures    that      the 

estimator  of  "TH"  is  not  shifted  so  far  xrom  the  sample  mean 
tnat  the  estimator  is  inconsistent  with  the  sample  mean. 
The  ttodif ication  is  a  compromise  cetween  the  James-Stein 
estimator  and  tne  Maximum  Likelihood  Estimatcr  which  has 
good  individual  properties.  The  compromise  would  follow  the 
James-Stein  rule  as  closely  as  possible  subject  to  a  fixed 
constraint  en  how  far  the  estimator  is  allowed  to  deviate 
from  the  MLE.  Such  "limited  translation  estimators"  were 
discussed  in  a  Bayesian  framework  in  tne  Efrcn-Morris 
article  [Eef.  11].  Tney  showed  that  it  is  possible  to 
consideraDly  reduce  the  maximum  possible  risk  for  any  compo- 
nent wnile  reducing  the  ensemble  savings  of  the  James-Stein 
estimator   by    as   little   as   5    or    10    percent  [Ref.     10]. 


5S 


D.   VALIDATION  TESTS 

A  validation  test  was  conducted  to  evaluate  the  effi- 
ciency of  tiie  James-Stein  shrinkage  estimator.  The  test  was 
conducted  as  follows: 

1.  Select  a  grade  within  an  Occupational  Group  to 
examine.  The  resultant  desired  data  array  will  be 
three  dimensional  (years,  LOS,  M03) . 

2.  Let  "i"  stand  for  LOS,  then  i=0, ,30. 

3.  Let  "j"  stand  for  MOS,  (tnen  values  of  j's  depend  on 
whicn  MOS  group  is  being  analized) . 

4.  Let  Dij  =  Incidence  Matrix  of  nonstructural  zeroes, 
which  is  the  same  for  all  years.  Dij  =  1,  if  cell  is 
member  of  the  feasible  region;  Dij  =  0,  otherwise. 

5.  Let  K  =  number  of  feasible  ceils,  i.e.,  sum  of  all 
Dij. 

6.  Let  Yij  =  number  of  leavers  in  cell  (i,j). 

7.  Let  t  =  1,...,I;  where  T  =  number  of  years  of  data 
used  to  create  tne  estimator. 

8.  Let  Nij  =  Inventory  in  cell  U/j)  = 
Max  ((N  (t)  ♦N(t  +  1))/2,  Y  (t))  . 

The  validation  procedures  used  t=1...4  to  compute  the  empir- 
ical estimates  and  used  t=5,6  for  validation  purposes. 
Three  estimation  methods  were  employed:  James-Stem, 
Minimax,  am  Maximum  LiKelihood. 

1  •   Preliminary  Steps : 

The  following  steps  were  utilized  to  prepare  the 
data  for  the  validation  procedures.  These  steps  included 
the  transformation  of  the  given  data  of  leavers  and  computed 
inventory  according  to  the  variance  stabilizing  arc  sine 
transformation  listed  in  Table  XV.  See  Appendix  D,  Figure 
D.18  for  the  APL  listings  of  the  following  functions. 
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•  let  ISij  (t)  =  Yij(t)  BINPBEP  Nij(t),  where  "BINPREP" 
prepares  the  Freeman-Tukey  version  of  the  arc  sine 
transformation    for   binomial   data. 

•  let  T   =    4,    years  of    data    desired. 

•  Let  Jij  =  Dij  JAMES  ISij(t),  where  "JAMES"  retarns  a 
James-Stein  estimator  for  the  means  of  the  cells  in  the 
last  two  dimensions  of  "IS"  while  being  screened  by  the 
incidence   matrix   "D"   of   nonstructural   zeroes. 

•  let  Pij  =  Jij  BINCONV  Nij(t),  where  "BINCONV"  inverts 
the  arc  sine  transformation  used  in  preliminary  step 
cne. 

2  .      Validation   Procedures : 

Ihe  following      procedures    were   utilized      to    validate 
the    effectiveness   of    the    the   James-Stein    estimation    process. 

•  let  IS*ij  =  ISij(I+l),  wnich  is  assumed  to  be 
distributed    Nor mal (Jij , unknown  variance). 

•  let  NOR  =  IS*ij  -  Jij,  where  Jij  was  derived  in 
preliminary    step    three. 

•  Cse  the  distribution  fitting  capaoility  of  GEAFSTAT  or 
any  other  comparable  graphical  display  package,  to 
compare  NOR  to  the  Normal  distribution.  Desired  output 
is  d  comparison  nistogram  of  the  data  to  a  fitted 
Normal  density,  compare  sample  distribution  to  fitted 
Normal  distribution,  normal  probability  g-Q  plot  with 
data,  and  possibly  the  survivability  curve  fit.  These 
plots  can   all   be    plotted    on   one  sheet   of  output. 

•  Compute  test  statistics  comparing  data  fitted  to  a 
normal  distribution. 

•  Compute  the  figure  of  merit  from  the  test  statistics 
using  the  sum  of  the  square  of  the  sample  mean  and  the 
square   of   the  sanple   standard    deviation. 
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3 •      flinimax    Estimator 

The  following  procedures  were  utilized  tc  compare 
the  James-Stein  estimator  to  the  :iinimax  estimator.  See 
Table  XV  for  the  formula  for  the  Minimax  estimator.  An 
simple  example  of  the  possible  values  resulting  from  the  use 
of  this  formula  is  provided  in  Table  XVI,  to  illustrate  the 
shrinkage  characteristics  inherent  in  its  use  for  the 
purpose  of  projecting  rates  whicn  have  extreme  empirical 
values. 


TABLE    XVI 
HININAX    EXAHPLE    VALOES 


MINIMAX 

LEAVERS 

INVENTORY 

ESTIMATE 

0 

0 

.50 

0 

1 

.25 

1 

1 

.  75 

0 

a 

.  17 

1 

4 

.  33 

See    Appendix      D,       Figure    D. 19    for      tne   APL    listings      of    tnis 
function. 

•  Let  PMMij  =  Yij(t)  MINMAX  Nij  (t) ,  where  "MINMAX" 
returns    the    MINMAX   estimates   for    the    binomial. 

•  Let  ISM  =  Nij  ARCSIN  PMMij,  where  "ARC3IN"  returns  the 
inverse  sine  transformation  for  use  when  the  success 
probability  PMMij  is  estimated  directly  (e.g.,  by 
ftlNMAX)  . 

•  Let  NOfvMM  =  IS*  -  ISM,  where  IS*  is  from  validation 
procedures    step    cne. 
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•  Use  the  distribution  fitting  capability  of  GRAFSTAI  or 
any  other  comparable  graphical  display  package,  to 
compare  NORMM  to  the  Normal  distribution  similar  to  the 
validation    procedure   three, 

•  Compute  test  statistics  comparing  data  fitted  to  a 
Normal   distribution. 

•  Compute  the  figure  of  merit  from  the  test  statistics 
using  the  sum  of  the  square  of  the  sample  mean  and  the 
square   of    the  sample    standard    deviation. 

^ •      Maximum   Like lihood 

The   natural   empirical    estimator    without    shrinkage  is 
the      Maximum      Likelihood      estimator-  Comparison      of      the 

James-Stein  process  to  the  Maximum  Likelihood  process  will 
illustrate  the  efficiency  of  tne  James-Stein  estimation 
process. 

•  let   Y    =   sum    of    Yij(t)     for    t=1    to   T.       Where    T   =    U. 

•  let    N    =    sum    of    Nij(t)     for    t=1    to    T.       Where    T   =    4. 

•  let    ISAij    =    Y    BINPREP    N. 

•  let  NORA  =  IS*  -  ISAij,  where  IS*  is  from  the  valida- 
tion   procedures    step   one. 

•  Use  the  distribution  fitting  capaoility  of  GRiiFSTAT  or 
any  otner  comparaole  graphical  display  package,  to 
compare  NORA  to  the  Normal  distribution  similar  to  the 
validation    procedure   three. 

•  Compute  test  statistics  comparing  data  fitted  to  a 
Normal   distribution. 

•  Compute  the  figure  of  merit  from  the  test  statistics 
using  the  sum  of  tne  square  of  the  sample  mean  and  the 
square   of   the   sample   standard    deviation. 
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E.       DATA    ANALYSIS 

A      data      analysis      was      conducted      atiiizing      the      three 
comparison  mentioned    above- 

1 .  The   James-Stein   Estimator 

Four  years  of  data  (1977-1930)  was  used  to  construct 
the  James-Stein  estimator.  The  leavers  and  inventory  were 
transformed      with      the   Freeman-Tukey      transformation.  The 

projected  year's  (1981-1983)  leavers  and  inventory  were 
constructed  and  also  transformed  witn  the  Freeman-TuKey 
transformation.  The      difference    between      the      James-Stein 

estimator  and  the  projected  years  data  was  plotted  to 
compare  tne  fit  to  a  Normal  distribution.  The  figure  of 
merit  (FOM)  was  computed  from  the  inputs  taken  from  the 
statistical    table. 

FOM=   (squared  mean)  +  (variance) . 

Table  XVII  illustrates  the  results  of  the  computa- 
tion cf  the  James-Stein,  Minimax,  and  Maximum  Likelihood 
FOM's-  The  data  organization  format  of  the  table  is 
provided  for  ease  of  interpretation  of  the  table.  The  table 
demonstates  the  calculation  of  the  FOM  for  the  years  1981, 
1982,  and  1983.  Ihe  separate  cases  evaluated  are  identified 
by  rows  which  represent  the  grades  of  First  Lieutenant  and 
Lieutenant  Colonel  in  each  of  the  four  MOS  groups  utilized 
m  tnis  project. 

2 .  The  Minimax  Estimator 

The  leavers  and  inventory  of  the  four  years 
(1977-1980)  were  used  to  construct  a  minimax  estimator.  See 
Table  XV  for  the  formula  utilized  to  derive  this  estimator. 
The  result  was  compared  to  tne  fit  of  the  Normal  distribu- 
tion in  a   similar  manner  as  discussed  above.     The  FOM  was 
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computed  as  shown  abcve  and  further  illustrated  in  Table 
XVII. 

3  •      Maximum  Likelihood 

The  last  comparison  was  using  the  leavers  and  inven- 
tory of  the  four  years  (1977-1980)  and  only  using  the 
Freeman-Tukey    transformation.  The   result      is    the      Maximum 

Likelehood   estimator      in   the    transformed   space.  The  algo- 

rithm is  the  same  as  James-Stein  estimator  described  above, 
only      the        James-Stein      calculation      is        by-passed.  The 

projected  years  data  was  used  to  compare  the  resultant 
difference  with  the  fit  to  the  Normal.  The  FOM  was  computed 
as    shewn  above  and   further    illustrated   in    Table    XVII. 

U .      Figure   of   Merit 

The  figure  of  merit  is  representative  of  the  effi- 
ciency of  the  estimator.  Since  the  FOM  is  computed  in  the 
transformed  and  Normalized  scale,  the  smaller  the  FOM,  the 
Detter   the   estimate.  The    ratios    illustrated    in      columns    3 

and  5  of  Table  XVII  illustrate  (in  most  cases)  how  many 
times  larger  the  other  estimating  technique's  FOM  is  when 
compared   to      the   James-Stein   estimator    technique.  One   can 

note  that  in  one  case  the  use  of  the  James-Stein  estimator 
technique  resulted  in  an  improvement  of  as  much  as  30  times 
over  the  Maximum  Likelihood  process  and  22  times  over  the 
Minimax  process.  These  results  are  typical  of  other  cases 
executed  by   the  author,    but    not   snown    for   redundancy   sake. 

F.       ACDITIONAI   COHPAfilSOHS 

The   above      risk   ccmparisons      were   all      performed    in      the 

transformed    (Freeman-Tukey)     space.         Since    this    distorts    the 

scale    it   seems      wise    tc   perform   some      additional  comparisons 

(sum      of    squares      error)  in      the    original       (u ntransf ormed) 
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TABLE    X7II 
FIGaBE   OF    HEfilT    COHPAfilSOtiS 

♦♦♦♦♦******       DATA    ORGANIZATION    FORMAT       *********** 
*♦*       COLUMN       ♦**  *♦*       REMARK       *** 

JAMES  FOM   of    data    using    James-Stein 

MAXLIKE  FOM    of    Maximam   Likelinood 

MAXIIKE/JAMES  Ratio   of    Maxiike    divided    by    James 

MINIMAX  FOM    of    data    using    Minimax 

MINIMAX/JAMES  ttatio   of    Minimax    divided    by    James 

♦**       ROW  *«♦  ***       REMARK       ♦*♦ 

1  1ST    LT    in    Aviation 

2  1ST    LT    in   Combat    Service    Support 

3  1ST    LT    in   Combat    Support 

4  1ST    LT    in   Ground    Combat 

5  LTCOL    in   Aviation 

6  LTCOL    in   Combat    Service    Support 

7  LTCOL    in    Combat    Support 

8  LTCOL    in   Ground   Combat 

****         IHESE    ARE    THE    FIGURES    OF    MERIT    FOR    1981       **** 

MAXLIKE/  MINIMAX/ 

ROW  JAMES  MAXLIKE  JAMES  MINIMAX  JAMES 


1 

1.69 

51.47 

30.46 

37.68 

22.30 

^ 

0.71 

1.60 

2.25 

0.78 

1.  10 

3 

1.20 

3.69 

3.08 

2.10 

1.75 

^ 

1.  19 

5.47 

4.59 

3.81 

3.20 

c 

3.05 

13.64 

4.47 

8.92 

2.92 

6 

0.22 

0.  42 

1.91 

0.23 

1.05 

7 

0.31 

2.34 

7.55 

1.  16 

3.74 

8 

0.79 

6.81 

8.62 

3.  73 

4.72 

*♦*♦         THESE    ARE    THE    FIGURES    OF    MERIT    FOR    1982       *♦** 

MAXLIKE/  MINIMAX/ 

RCH  JAMES  MAXLIKE  JAMES  MINIMAX  JAMES 


1 

4.37 

47.05 

10.77 

34.29 

7.85 

^ 

0.75 

1.55 

2.07 

0.76 

1.01 

3 

1.53 

3.71 

2.  42 

2.19 

1.43 

4 

1.89 

5.84 

3.09 

4.21 

2.23 

5.80 

17.28 

2.98 

13.08 

2.2o 

6 

0.23 

0.54 

2.35 

0.33 

1.43 

7 

0.59 

2.  60 

4.  41 

1.43 

2.42 

8 

1.  13 

8.34 

7.38 

4.92 

4.35 

****  THESE  ARE    IHE    FIGURES    OF    MERIT    FOR    1983       **** 

MAXLIKE/  MINIMAX/ 

ROW  JAMES  MAXLIKE                JAMES          MINIMAX  JAMES 

1  4.64  43.16  9.28            30.85  6.63 

2  0.91  1.64  1.80               0.86  0.95 

3  1.17  3.67  3.14               2.02  1.73 

4  2.85  5.b7  1.99               4.21  1.48 

5  5.49  11.93  2.17              8.44  1.54 

6  G.2o  0.56  2.15              0.34  1.31 

7  0.t)2  2.76  4.45               1.53  2.47 

8  1.57  8.55  5.45              5.16  3.29 
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scales.         Accordingly   the   deviations   of   the   empirical   attri- 
tion   rates    in      the  validation    time   frame      from    the   estimates 
(both   Janes-Stein   and   Maximum      Likelihood)      are    computed   and 
compared.      Specifics    follow: 

The    preliminary    steps   are   as   follows: 

•  Actual  rates:  empirical  attrition  rates  are  the  central 
attrition  rates    for    the   projected    period. 

•  James-Stein  Projected  rates:  projected  attrition  rates 
utilizing  the  James-Stein  estimation  process  previously 
defined. 

•  Let,  MLE=  Maximum  Likelihood  Estimates,  the  historical 
empirical  attrition  rates  are  assumed  to  remain  the 
same   throughout    the    projected    period. 

•  Let,  SEP=  The  squared  differences  between  the 
James-Stein  projected  attrition  rates  and  the  actual 
rates. 

•  let,  SEM=  The  squared  differences  between  the  MLE  and 
the   actual   rates. 

•  Let,    SSE(MLE)=    Ihe   sum    of    squared    differences    of   SEM. 

•  Let,    SSE  (P)  =    The    sum    of    squared    differences    of    SEP. 

•  Let,  SEP<SEM  be  the  number  of  cells  for  which  SEP  was 
less  than  SEM.  (For  convenience  of  column  heading, 
inequalities   are   allowed    to   represent    numbers.) 

•  let,  SEM<SEP  be  the  number  of  cells  for  which  SEii  was 
less   than   SEP. 

•  f.OS  Groups  evaluated  were:  CS  -  Combat  Support  Group; 
GC    -  Ground    Combat 

•  Banks  evaluated  were:  LT  -  First  Lieutenant;  LTCOL  - 
Lieutenant  Colonel 

1  .      Subtest    ^ 

First  it  was  assumed  that  all  cells  in  tne  feasiDle 
region  were  eligible  fcr  forecasting  tne  projected  attrition 
rates.    The  results   of  subtest  1  are   illustrated  in  Table 
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XVIII.  The  sum  of  squared  differences  for  the  MLE  and  the 
James-Stein  projected  rates  for  the  first  and  last  groups 
are      competitive.  One      has      to      keep      in      mind      that      the 

Jame-Stein  estimation  process  will  result  in  a  value  greater 
than  zero  given  a  cell  with  an  original  value  of  zero. 
Therefore,  when  ail  tne  cells  in  the  feasible  region  are 
considered  eligible  fcr  projection,  the  empirical  values  of 
zero  result  in  a  value  closer  to  the  grand  mean.  The  value 
of  the  sums  of  the  differences  between  the  James-Stein  esti- 
mation process  and  the  actual  rates  can  be  understandably 
larger  than  the  sums  of  the  differences  between  the  ML2  and 
the  actual  rates  when  there  are  numerous  zero  cells 
originally    in    the    feasible   region. 
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1  1 

I  TABLE    I7III  I 

I  SOBIEST    1  1 


Feasible 
Group    RaLK         Cells         SEP<SEB      SEIKSEP      SS£(MLE)        SSE(?) 

CS  LT  57 

GC  LT  57 

C£  LTCOL  48 

GC  LTCOL  48 


15 

42 

1.52 

1.91 

22 

35 

0.33 

3.99 

12 

36 

1.29 

2.93 

21 

27 

2.05 

0.66 

2.   Subtest  2 

A  second  comparison  assumed  a  conditioricg  that 
considered  only  the  cells  ic  which  there  was  an  actual 
attrition  rate  greater  than  zero.  In  other  words,  what  was 
the  projected  attrition  rate  given  an  actual  loss  occured? 
This  created  a  subfeasiDle  region  whicn  was  smaller  than  the 
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whole  feasible  region.  The  results  of  subtest  2  are  illus- 
trated in  Table  XIX.  The  sum  cf  squared  differences  for  the 
James-Stein  projected  rates  in  this  subtest  are  much  lower 
than  the  MLE. 


r 

TABLE  XIX 

SOBTEST 

2 

Sub- 

Feasicle 

Group 

Rank 

Cells 

SEP<SEM 

SEM<SEP 

SSE  (MLE) 

SSE 

(P) 

C£ 

LT 

26 

15 

11 

1. 

52 

0. 

27 

GC 

LT 

29 

22 

7 

0. 

33 

0. 

10 

C£ 

LTCOL 

16 

12 

4 

1, 

29 

0. 

1(4 

GC 

LTCOL 

28 

21 

7 

2. 

05 

0. 

12 

bS 


VI.  CONC 10 SIGNS  AND  RECOMMEND ATIOMS 

A.  SOHMAfiT 

The  purpose  of  this  study  was  to  demonstrate  the  appli- 
cation of  the  James-Stein  and  other  shrinkage  type  parameter 
estimation  schemes  for  the  goal  of  generating  manpower  loss 
rates  within  the  USMC  officer  force  structure.  This  thesis 
contains  comparisons  of  performance  of  James-Stein,  Minimax, 
and  Maximum  Likelihood  estimations  of  Marine  Corps  officer 
attrition  rates  for  select  cell  aggregates.  The  very  large 
number  of  cells  within  the  USMC  officer  force  structure 
leads  tc  the  condition  that  empirical  attrition  rates  are 
unstable.  This  problem  is  compounded  by  the  fact  that  the 
cell  probabilities  are  small.  Most  rates  are  less  than  ^0% 
and  virtualxy  none  are  greater  than  20%  at  disaggregated 
levels.  Further  difficulties  are  present  because  some  of 
the  inventory  cells  are  empty  for  structural  reason  while 
otners  are  empty  by  chance.  Therefore,  the  small  inventory 
cells  draw  especial  attention.  It  has  been  illustrated 
witnin  this  project  tnat  improvement  can  be  attained  by 
application  of  the  James-Stein  and  Minimax  shrinkage  methods 
ratner  than  the  more  natural  Maximum  Likelihood  estimation 
process.  It  is  important  to  note  tnat  the  James-Stein  and 
Minimax  snrinKage  schemes  seem  to  compete  for  appropriate 
attrition   rate   generation. 

B.  CCHCiaSIOHS 

The  shrinkage  schemes  employed  herein  offer  powerful  and 
useful  methods  for  generating  attrition  rates.  This  employ- 
ment should  lead  to  much  lower  costs  that  accrue  from  errors 
in      manpower    planning.  However,         tae      particular    way      to 

exploit    them    is    yet    tc   oe   determined. 

7  0 


The  application  of  the  James-Stein  shrinkage  technique 
has  exposed  a  number  of  proiDiem  areas  which  could  be 
expanded. 

The  aggregation  problem.  Aggregation  of  cells  (cross 
classified  by  grade.  Military  Occupational  Specialty,  length 
of  service,  etc)  into  sets  whose  attrition  behavior  is  homo- 
geneous and  of  low  internal  variability  should  lead  to  the 
most  useful  overall  performance.  One  method,  not  attempted 
by  the  author,  is  to  aggregate  over  MOS  types  in  which  the 
grade  structure  is  siiilar.  For  example,  some  MOS*s  are 
restricted  to  warrant  officers  only,  warrant  officers  and 
limited  duty  officers  only,  unrestricted  regular/reserve 
officers  only,  etc.  Another  method  would  be  to  aggregate 
over  certain  length  of  service  periods.  For  example,  1-U 
years  of  service,  5-10  years  of  service,  10-20  years  of 
service,  and  21-30  years  of  service.  Thus  examining  the 
attrition  rates  in  relationship  to  breakpoints  and  longevity 
in    the   officer's    career    pattern. 

The  limited  translation  problem.  The  shrinkage  estima- 
tors translate  tne  rates  toward  the  aggregate  mean.  Such 
translations  can  appear  to  be  excessive  in  instances  of 
extreme  rates.  This  suggests  that  techniques  that  limit  the 
translation  could  be  profitably  applied.  Current  literature 
contains  limited  translation  methods  wnicn  could  be 
examined. 

The      yearly    update      problem.  The    estimated      attrition 

rates  are  based  upon  data  from  several  recent  years.  As 
each  year  produces  a  new  set  of  experiences  there  is  need 
for  a  policy  to  include  new  data  in  the  estimation  scheme 
and    phase  out    the    influence    of   the   distant    past. 

The      validation    proDlem.  The   smallness      of    the      rates 

(i.e.,  Q~2Q%)  places  the  current  project  in  a  range  wnich 
has  net  yet  been  treated  successfully  in  the  literature. 
For  this  reason  it  is  especially  important  to  apply  cross 
validation    procedures    to    score    the    efficacy    of    the    metnods. 
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Ihe  JDult inomiai  estimaticn  problem.  The  number  of 
attritions  in  a  ceil  are  placed  into  a  number  of  (currently 
eight)  disjoint  categories.  This  produces  a  number  of  small 
sample  size  multinomial  probability  estimation  problems. 
Methodologies  for  managing  such  proDlems  have  attracted 
attention  in  the  recent  literature.  Their  usefulness  could 
be  further  examined  in  regard  to  the  officer  attrition  rate 
estimaticn  process 

C.   EICOHMENDATIONS 

It  is  recommended  that  further  studies  pursue  the  prob- 
lems identified  above.  It  is  suggested  that  a  new  data  tape 
be  provided  to  the  Naval  Postgraduate  School  which  allows 
the  distinction  between  restricted  and  unrestricted  offi- 
cers. Additionally,  the  data  for  fiscal  years  1984  and  1985 
needs  to  be  provided  in  the  near  future.  In  order  to 
provide  continued  service,  a  regularly  scheduled  yearly 
updated  data  set  should  be  forwarded  to  the  Naval 
Postgraduate  Scnool  when  available. 
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kZRMUU   1 

CENTEAL  ATTRITION  BATE  GfiAPHICS 

This  appendix  contains  graphical  illustrations  of  the 
central  attrition  rate  for  USMC  officers. 

Figures  A.I  to  A. 10  contain  the  central  attrition  rates 
for  each  Military  Occupational  Specialty  distributed  over 
the  lenqtn  of  service  period  from  0  to  30.  These  figures 
illustrate  the  aggregate  central  attrition  rate  for  all 
eight  types  of  losses  over  the  six  years  from  1977  to  1982- 
These  figures  demonstrate  the  breaKpoints  within  each  MOS 
over  the  distribution  of  length  of  service. 

Figures  A. 11  to  A. 20  contain  the  central  attrition  rates 
for  each  Military  Occupational  Specialty  distributed  over 
the  grades  from  Warrant  Officer  to  Colonel  (labelled  0...9 
on  the  abscissa).  These  figures  also  illustrate  the  aggre- 
gate central  attrition  rate  rcr  all  eignt  types  of  losses 
over  the  same  six  years. 

figures  A. 21  to  A. 28  contain  the  central  attrition  rates 
for  each  Military  Occupational  Specialty  distributed  over 
the  years  from  1977  to  1982.  Each  figure  nas  five  MOS 
graphs  and  are  dis tinguishacle  by  the  type  of  line  and 
symbol  points  for  each  separate  MOS.  These  figures  illus- 
trate the  aggregate  central  attrition  rate  for  all  eight 
types  of  losses  over  the  six  years  from  1977  to  1982. 

Figure  A. 29  contains  the  central  attrition  rates  lor  the 
AviatioE  Group  for  the  grades  from  Second  Lieutenant  through 
Lieutenant  Colonel  for  tae  years  1977  to  1982.  Figure  A. 30 
contains  the  central  attrition  rates  for  the  ComLat  Support 
Group  for  the  grades  froiu  Second  Lieutenant  to  Lieutenant 
Colonel  for  tne  years  1977  to  1962.  Figure  A. 31  contains 
the  central  attrition  rates  for   the  Ground  Combat  Group  for 
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the  grades  from  Second  Lieutenant  to  Lieutenant  Colonel  for 
the  years  1977  to  1982.  Figure  6T0TMGS  contains  the  central 
attrition  rates  for  the  Total  MOS's  for  the  grades  from 
Second  Lieutenant  to  Lieutenant  Colonel  for  the  years  1977 
to  1962. 
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CENTRAL  AHRITION  RATES 
AVIATION  (MOS  38) 
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Figure    A. 29        Central   Attrition   Rates   for   the   Aviation   Group. 
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CENTRAL  ATTRITION  RATES 
GROUND  COMBAT  GROUP  (MOS  03.05.10) 
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Figure   A. 31        Central  Attrition   Rates 
for    the  Ground   Conbat   Group. 
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APPENDIX    B 
JAHES-STEIN    ESTIHATOB    ALGORITHH 

Ibis  appendix  contains  the  detailed  algorithm  for  the 
James-Stein  estimator  of  attrition  rates  for  the  USMC 
officer    force    structure.      The   information   provided    includes; 

•  Assumptions. 

•  Notation. 

•  Steps    to    generate   the    Jaaes-Stein   attrition   rates. 
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ssE  =  EEE:-^'.;(')  -  -v,;  -A; 

I  =  1;  -  11  =  1 

SSB   =  SST  -  SSE 


STEP  4:  Compute  the  set  of  James-Stein  estimators  in  the  transformed  scale. 


C   =   1 


T(K  -3)       SSE 
A'(r-l)-2     SSB 


X  ~  C(X.,-X) 
unde  f  ined 


if   D„   =   1 


othiruise 


STEP  5;  Invert  the  transform  to  produce  the  attrition  rales  r 
/ 


1  ,  P 

—  1  -  sm( 0.5) 

2  n.. 


if 


2       n 


tf    -iL,^ 
2       n., 

tf   ^,-^- 
2        n„ 


JL    JL 
2  "    2 

2 


".;    =    "tE".;  {' 
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.4.    SIMPLE  STOCHASTIC  MODEL  EOR  CE\TRAL  DA  TA 

This  section  contains  the  nomenclature  and  characteristics  of  a  simple  stochastic  model  for 
central  data.    It  deals  with  the  joint  distribution  of  cell  inventory  and  number  of  attritions. 

The  following  nomenclature  will  be  defined  in  this  section  as  follows: 
N  =  Inventory  during  period,  which  is  distributed  Poisson  (  A  )  . 
Y  =  Number  of  losses. 

m  =  Central  attrition  rate  =    —    . 

S 

The  conditional  probability  of  losses  given  inventory  is; 

/  [y  :   -V  =  n)  =     (    ^   )     0"    (   1   -     e    )"  -  "  y    =  0 n 

The  joint  probabilitv  function  is: 

/(y. ")-/(«/      ")g(n)=      (^)      ev    (  1   -     0    )"-.    e-    ^    -^  0  ^   y    ^   n    ^  oc 

Let 


h{y]  -^    P    Y  =  y    =     Y.   f  [y.n 


e 


A  e 


a  =  1/ 


n    -   y    )\ 


1   -     6    )  A     " 


(  1  -    0    ) 


A  " 


Let  z   -- 

'-     n    -   y 

{  X  &y 

y! 

(A    0)" 

A  e 


0    )  A    '' 


Therefore  ,  ^    is     Poisson    (     A    0    )      .  which  is  the  marginal  distribution. 
Now.  the  estimated  value  for  the  central  attrition  rate  is: 

Em     --   ^'    -\-     -   E  Ei  ~      .\]    ^   E    -^  E(Y     y )  -  E    —  .\    0     =     0 
The  conditional  probability  of  inventor\   given  losses  is: 

f  'y  ■"  ) 


P{N^n       y^y)  =    g(r,      y 


^'ly) 


C) 


0 


0 


;      I     0      A     )^ 


A  "• 


Ti    ^:  cc 


I   -     0 


A  " 
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B.    SLPPOR  TING  DIS TRIE UTIO\S 

The  following  sections  of  this  appendix  contain  a  series  of  proofs  to  the  following  statements 
based  on  the  assumption:  Y  is  distributed  Normal  (0./)  .  v\here  I  is  the  Identity  matrix 

i)    U  =  — is  distributed  Normal  (         0        .1)  . 

...  _     ..      ere        ,,    .,      0 .    ,.    ..      . . ,       ,  ,^  ,        ©0'     . 

11)    Z  =   I  -  - — X  =    ^  ~  ^~ ~  IS  distributed  -Normal  (0,/ 


,    ,  0.    i  -  1    ,  0i    i  I    I  0 

See  section  C  for  the  following  statements: 

iii)    V  =    I    !   Z  j    I      is  distributed  X{k^i]  ■ 

iv 

PROOFS. 


w  =  u-^  v  -  I  !  r  I  :  2 


i)    U  is  independent  of  all  components  of  Z.   In  other  words  all  Cov  (U  ,Z  )  -  0  .    Proof  is 
as  follows. 

Cor(r.Z)=    Cox[U.Y ^-^ )=    Cov[l-,Y] ^ Cov{U,U) 

Now,  CoviU.Y)  =    Cov{       ®    ^       ,y)  -   —^ ^e,Cov(Y,Y)  ® 


0     ;  !    :  0         ;Ti  '     >•      '  0 

Therefore.  Cov{C.Z)  -   1 1 -  0.    Q  ED. 

ii)    The  distribution  for  Z  is  Normal  and  the  parameters  are  shown  as  follows. 

z,  -  >>$]•    ■•    ' 


;^.  0  ' 


£(Z)-£(y-) ^r(n-0--^|^  =  o 

£(ZZ  ')-(£Z)(£Z)'  =  £(ZZ  ')-0  =   E(ZZ  ')  =   Cot  (Z  .Z 
Coi{Z  .Z)  --    Coi{Y — —  .Y  ^®       ' 


0  '  0 

Cor  (  }  .  }   )-  Cov  ( .  }   ]-Coi  (  }   . )-  (Of 


0  '         '  e      '         '       @      '      e 

Now    Tot  (  r  .  }   )   =    /  .  H  here  I  is  ihe  identity  matrix  since  ^    i:-  distributed  Normal  (0.  /  )  . 
Also. 

„    ,      re  /-0  re     ,,      re     ,      ^,      re     .   ^,      re 

Cot . )    --    £   : : -   £    ■ 


0  0  6  0  '      i=)      '     '      e 

0  e 

Eir]  — ^ — Eir 


-     F  i 

re 

r© 

•f^  \ 

6          ' 

'       0 

^    E  - 

re      , 

0 

le 

e 

=   E{1 

n                  00- 

\      P\(-\ ' 

e 

=  £(r 

,      ee 

GsP> 

'       e 

=  li- 

0     '•)■ 

0©' 

0  '  0 


00' 
0 
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C    DISTRIBUTION  "V' 

This  section   is  an  continuation   of  section    B  and   shows    1=7'/=  Z         "is  distn- 

K  -1 

buted  \[\-i]  =     Y^  yormal  (O.l)-. 
I 

PROOFS. 

iii)    First,  construct  an  orthonormal  transformation  P,  where  P  is  K  by  K  dimensional  and 
P    P  =  I ,  where  I  is  the  identity  matrix. 

Let  Pk  '  be  the  last  row  of  P  and  be  defined  by 

0 


Pk    = 


6 


I    !  P/f  i    !  '  -   1 

Note  that  this  can  be  done  because  an  orthonormal  tranformation  can  always  be  completed. 
Now  let  .Y    =   PZ  and  calculate 

■^K    =  Pk  'Z 


1  =0 


0 


0 


S  ©;(>■; 


ue. 


;=C 


0 
1 


E©;^-; 


0       ; 

r 


0 


;-0 


['  © 


0 


;  -0 


_r 0_ 

0 


Therpfore  A^    =  0  random  variable,  and  the  space  is  K-1  \  ice  K  dimensional 
Ne.xt  calculate 

Cot  [  .V  .  .V  )  -    Cov  {PZ  .PZ)  =   E{PZ  ](PZ)' 
=   E  ( PZZ  -P  ■)  ^   P  E[  ZZ  ■  )  P  ' 

00 


=   P(Cot  \Z  .Z]]P  '  -   P  [I  - 


w 


pp  ■  -p- 


00- 

0 


P  -   --    IP   p.     p,    '  P 


P  Pk    Pk      P  ' 


0    0 

0    1 


KzK 


Therefore. 
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D.    yON- CE.\ TRA L  CHI  SQUARE 

This   section   considers    the    Non-Central    Chi    Square    with    n    degrees   of  freedom.     Letting 
Y  I,  .  .  .  ,  Y„     be  iid  Sormal    (  0,1  ),    then     \^\^  -    J]  ^  <^    ^'"^^  density; 


/.(") 


2'r(-) 


—  1 
u'        e 


0  <   n    <  oc 


But  if  the  means  are  not  all  zero  then  the  result  is  the  Non-Central  Chi  Square  random  vari- 
able. 

i)      With  n  =  l  and  letting  Y  be  distributed    Sormal    (0,1)    and      V   =    Y',  then  the  density 
of  V  is  as  follows 


y     ~    V  y    -    rr\  i' 


dv 


f  v{^)  ^     f  y{\'^'   )  +  f  y{-  vt^   ) 


1  1 


2\  V     \  '2n 
1 


c- 


—    V  c    -  e 


1 

d  1    v' 

^       t'^                     ^ 

dv 

2                   2v  f 

1 

0  <  t;    <  oc 

2\V 

,3                 -    ± 

-  ^V  - e  A 

—    e        "  ; 


>\   V      \  -111 


1  1 


2v  y     \  'In 


e      -e       -    le^^'     -   e-^ 


1  ~  "^      1  "  T  — 

-  — =-  e      "  Cosh  (GK  t' 


27r 


\  t' 


e2 


e  e         ^  >  , 


\  27r 


\    I'    n  =  0 


n  '     (2n 


I       (0^    p 


1         -  T    1     v^ 


f>-  .„   2"n' 


V  2>T  \     !■         _n        n 


n  -C 


'  •■>  (•', 


Whe 


A  - 


0- 


-  L^n(^ 


t'         "  e       ' 


n 


2-T       (2r( 


p.  (A 


1  A" 


117 


E     DISTRIBUTIOS    of  W  and  EXPECT  A  TIO.X  of  H" ' 

This  section  illustrates  the  characteristics  of  the  distribution  "\V".  which  is  the  convolution 
of  /  ,-2(«  )  (non-central  chi  square)  and  the  densit\  of  \("^-  -j)  =  f  h  -\i*^  )  ^^  ^^  noted  that  W  is  a 
non-central  chi  square  random  variable  vMth  K  degrees  of  freedom  and  non-centrality  parameter 
A. 

00 

/«-(»')=       J       f  ^,2(^    -V]    f  K-l    i^)    dv 

0 

-         f       t     Pn     (A)     /,„_,(u'-t)     /^..    (t)    dv 
0        n  =0 

=        E     Pn     (A)      J       /2n-.(u-f)     /a-,    (t    )    <it 
n  =0  0 

=      E   /'.   (A)   /,„.A-    ("  ) 


Desire  to  derive 


>         1 
E     ^n     (A)        r      -    /2.-/^(<^)^t 


1 

Let    r    -    '2n  -  h 


1  T     '         ~ 

Now   f,{u)   -    u  "      e       ^ 


•2-r(-) 

2 
1  "  1  >      1       -  - '    -  - 

^4-        =         E     ^n     (A)    -^ f       -    "    '  ^         '       '^"' 

2-r(-) 


E  P^  (A)  -^-^ —  /  "■ 


tfu' 


:-r(^,   ' 


Lp'    k    -      —     .  then   du    -     —     du 


^'     -^      -      tPniM-T^ j{-2u]'\^2du 


1  =  C  T  ^     r  0 

2-r(-; 
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F.    LOSS  A.\D  RISK  FUNCTIO.\S 

This  section  evaluates  the  Loss  and  Risk  functions.  Considering  the  column  vector  Y.  which 
is  distributed  Normal  (©./).  these  functions  are  as  follows. 

LOSS   =   L(a  .e)  ^   [a  -e)(a  -e)'  =  a-e        2 

Risk  is  e.xpected  loss,  and  for  the  ma.ximum  likelihood  estimator,  a  =  Y  and 

RISK  =  E  [  L  (Y  ,e)  '  =  E  (Y  -  e)-  {¥  -  Q)  =  E  (         V  -  0     ]  ')  =  K 

K 

Consider   another   function   of  Y      Use  }'  ;      ^  -    Y  '  Y  =    ^^  Y,^  which    is   distributed 

;=o 
Non  Central  \\\  ]■    See  section  D  for  the  properties  of  the  Non  Central  Chi  Square  Distribution. 
The  James-Stein  estimator  has  structure: 

a{Y)  =   (1 ^ )Y 

y 

It's  Risk  function  is  derived  as  follows: 
RISK  =   E'        (1 )   }■-  01    ^i 


-E       y-e     .      "■ 


Y         ■ 

=  £(>■_  0 ^1__  )  -  (  y  -  0 ^2— 

.  £  (V-  0)       ^L_    -    (y.  0)  ''>' 


2 


>      -                         }■      - 
-  E  ( y  -  0)  ■  ( >'  -  ©I ^. — T  r  -  (  y  -  0) ^ — ^  (>'-©)'  y ^- — -  y  y 

-   £•         y     0        -2 ? -Y-{Y-Q]  ^'  '■         " 


Y         '  Y 

=  E  Y-e        -  -2£ -Y-{Y-e)'b-E 


^   K  '-IE  : -}•'(}■-  0)  -6 -£         ' 


XiKil-^i 


Let 

EVAL   =   E 


Y  -0!    y 


=  .-      e      f(±i 


where  I    and  \\    are  defined  in  section  B  of  this  appendix, 
tion  G  in  t 
the  following 


See  section  G  in  this  appfodi.x  for  an  ►'valuation  nf  Ei 1        L  sins  that  result   now  leads  to 


121 


G.  EVALUATION  OF  E  {  —  ) 


r 


The  goal  of  this  section  is  to  evaluate    E  (  — -  ) 

First,  find  the  joint  distribution  of  L   and  W. 

fu.vi^  .1'  )  =    /f(«  )f  v(v) 

1 


1 


(u-  !  '  ei 


K  -  S 


\  27r 


A"  -   1 

2   -    ri 


K  -  1 


0<t;<oo        — oc<u,oo 


Change  of  variables: 


dt 


ti    —    u'  ~   V    ,  t    =    tr  —  W  ,         — —  =    1 

du' 


0<i<oc        =>         0<u-u     <oc        =>         u'  <  w    <  oc 

1 


/  f  ,  w-  ( "  • "'  )  =   — ^=-  c 


1  -  -lu-       e.  ,  )• 


A-   1 

"^""r.,     A    -      1 


A-    S  1  ,  J, 

2\        2  2 

u,'  ~  u  e 


2  '  n 


cx;<u<oc  u<u;<cx; 


Since      —  —  (u  —  0 

2    ^ 


/r,n  ("  ■"■ 


-(u^-2u  ,       0|    !  ^    i    1  0      ,  -)    it  follows  that 

2  I.I.  '     ; 


A  -    1 


2n  2     -       Tl 


K  -   1 


a  —  u  ' )      '       e 


x-^         ±„-.2u       ei 


Therefore, 


^'T 


—      e 


A  -    1 

2^2  *    n 


5C  14 

e 


I 


A'  -    1  ,     0 


-^  —  u 


e  du 


dw 


\^  here  the  range  is         u'  <   u  -  >         — \    u      <'   u    <  v  « 

Let  <    =   — =■  .  dt    =   — =-  .  and  -   1  <  f    <  -1.    Continuinf 


\     U' 


\   u 


—         B 


X  u 


^'Tf  1  = 


A  -   1 

2^  2  -    n 


K       1 


1 

r     ' 

A  -    3 
U 

I- 

t') 

A  -  3 

e 

6: 

•^  "    \    u   dt 

\    \    u 

du 


X  u 


A  -    3     1 


A  -   3 


A  -    1 


\    ii 


'>:r    9 


A'  -     1    ,       0 


X'"  =  —  / 


Mi-r) 


e      ( V  u 


(i< 


Now 


e     tvt 


0 


t'    U' 
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^(  — )  =  - 


K  -   I 


V2n-2 


J  =0 


0 


n  (2'-i) 


^;  -I 


2'    {j 


o;  -1 


n  (2'-': 


J  -■ 


Since  r(  —  )  =  \7r     and  cancelling: 


-  -;  i  e    I  '  ^  /=vi      li  -I 

£(  ^  )=   e      ^ 


U_ 

W 


E 


;=0  2^-';'  (j-4 


^(i)=i,ei       E/.(A)^ 


where    A  = 


;=0 
0  ' 
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/    MOXTHLY  VERSES  YEARLY  RATES 

This  section  provides  a  recommended   method  to  concert   yearly   attrition  rates  to  monthly 
rates. 

Let    r    =      yearly  attrition  rates. 

Let  r^    -     monthly  attrition  rates. 

.Assume  independence  from  month  to  month.    Thus  12  consecutive  survivor  months  implies 
a  survivor  year. 

1   -   r    =    (1   -   r^)'' 

1 
(1  -    r)i-    =    1   -   r^ 

1 
r.    =    1   -   (1   -   r)>^ 

Thus,  the  abo\e  equation  relates  the  yearly  attrition  rates  to  the  monthly  attrition  rates. 
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MSSCOUN       JCL     A 
//COUNTS  JOB  (2209. 5555), 'MAJ  D.  D.  TUCKER', CLASS  =  B 
//^■MAIN  0RG  =  NPGVM1  2209P 

//♦     t******^**r************  NOTE  TO  USERS  ♦♦♦♦♦♦♦■»•♦»♦■»♦♦♦♦•<'♦*»♦•♦ 
//♦       THIS  WILL  MOVE  THE  DATA  SET  COUNTS  FROM  MASS  STORAGE  TO  THE 
//*       MVS004  DISK.     FIRST  CHANGE  THE  XXXX'S  TO  YOUR  USER   ID. 
//♦       CHANGE  YYYY  TO  YOUR  PROJECT  NUMBER.  THEN  SUBMIT  THE  PROGRAM 
//*       TO  MVS.     WHEN  THE  SYSTEM  RETURNS  A  COMPLETION  MESSAGE.  USE 
//*       THE  GETMVS  SYSTEM  EXEC  TO  COPY  THE  DATA  SET  TO  YOUR  A-DISK 
//*       IF  YOU  WISH  TO  ACCESS  IT  FROM  CMS.     BE  ADVISED  THAT  THIS 
//♦       FILE  IS  HUGE  AND  WILL  TAKE  UP  NEARLY  4  CYLINDERS 

//* 

//♦  USE  DSN=   S2209  COUNTS         IN   GETMVS 

//  EXEC   PGM  =  IEBGENER 

//SYSPRINT  DO  SYSOUT=A 

//SYSIN  DO   DUMMY 

//SYSUTl    DD  DSN  =  MSS.F0968. COUNTS. DISP=OLD 

//SYSUT2       DD  D5N  =  S2209  COUNTS,VOL  =  SER  =  MVS004.UNIT  =  3350. 

//  SPACE  =  (CYL.(2.2).RLSE).DISP  =  (NEW.KEEP) 

/» 


Figure    D.I         MSSCOOM    JCL    A1. 

B.        DATA    MANIPOLATIOM    PBOGRAHS 

A  ttiChnique  utilized  Ly  tne  author  to  reduce  the  depen- 
dency or  manipulatixig  sach  a  large  data  iiie,  involved 
creatinq  a  rile  of  indices  corresponding  to  pertinent  data 
in    thfc      original    summary    data    iilt.  A    set    or       indices    was 

created    for    the      inventories   and    losses    identified      ror    each 
fiscal    year. 

1  •      iii2§Il torx    Indices 

The   set      indices    for      tne    inventory      can   ce      created 
using    the    following    procedures. 

•    Separate    the    suirniary    data    file      by    fiscal    year    data   and 
create    a    new    file    tor      each    riscal    year.  For    example, 

•'CCUUI77      DATA    C ••      ror       fiscal    year       1977       data    on      tne 
tfcuporary   C-disk. 
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TABLE    11 
RECOHDS    PER    FISCAL    lEAB 


Fiscal   Y 

1^77 

1978 

19  79 

1980 

1981 

1982 

1983 

Year 


Number  ol    Records 

2203 
2231 
2337 
2351 
^317 
2324 
2330 


FILE) 


NUMBER   EXEC   Al 


STRACE  OFF 

*  TO  EXECUTE  THIS  EXEC    TYPE   "NUMBER  <YEAR>" 
X      NHERE  YEAR  IS  ONE  OF  77-85. 

GLOBAL  TXTLIB  VALTLIB  VFORTLIB  IMSLSP  NONIMSL  CMSLIB 
GLOBAL  LOADLIB  VFLODLIB 
FORTVS  INV  (LVL(77)) 

*  FORTVS  &1  (LVL(77)  ) 
CP  TERM  LINESIZE  133 

SA  =  &CONCAT  OF  COUNT  SI 

SB  =  SCONCAT  OF  INV   SI 

FILEDEF  08  DISK  SA  DATA  C  (RECFM  F  LRECL  53  PERM 

FILEDEF  09  DISK  SB  ARRAY  C  (RECFM  F  LRECL  9  PERM   " 

STRACE  ON 

LOAD  INV  (START 

X  LOAD  SI  (START 

ERASE  INV  LISTING 

ERASE  INV  TEXT 

ERASE  LOAD  MAP 

SEXIT 


Figure    D.  3         NUMBER    EXEC    Al. 


13  1 


•  Ins  CAS  file  "INVXX  ARBAY  C",  can  be  read  into  an  APL 
workspace  and  assigned  to  a  numeric  variable  by 
executing  the  user  friendly  "CMSAPL  EXEC  A1".  See 
Figures  D.5  and  D.6.  Note  that  "CMSAPL"  has  a  provi- 
sion to  add  a  variable  to  an  existing  APL  workspace  or 
to  create  a  new  workspace  each  time  "CMSAPL"  is 
executed. 

The  file  of  indices  created  in  this  manner  can  be 
utilized  to  generate  matrices  in  the  APL  workspace.  The  two 
dimensional  array  for  each  fiscal  year's  inventory  is  cnar- 
acterized  by  a  length  subject  to  the  number  of  records  in 
the  summary  data  file  for  a  particular  fiscal  year  in  waich 
there  is  a  nonzero  inventory  count.  The  columns  of  the 
records  in  the  index  file  is  described  in  Table  XXI 

2  .   loss  Inaices 

Tne  set  indices  for  the  losses  can  be  created  in  a 
manner  similar  to  the  indices  for  the  inventory  by  using  the 
following  procedures, 

•  Separate  the  summary  data  file  by  fiscal  year  data  and 
create  a  new  file  for  each  fiscal  year.  For  example, 
"L0SS77  DATA  C"  for  fiscal  year  1977  data  on  the  tempo- 
rary C-disk. 

•  Execute  the  "LOSS  EXEC  A1"  cy  typing  "LOSS  <XX>"  where 
XX  is  the  desired  fiscal  year  (such  as  77  for  fiscal 
year  1977  data).  See  Figure  D.7.  This  executive  rfill 
load  the  Fortran  program  "LOSSES  FORTRAN  Al"  wnich 
reads  the  data  from  the  file  "C0UNI77  DATA  C".  See 
Figure  D.8. 

•  The  output  from  "LOSSES  FORTRAN  Al"  will  he  sent  to  a 
CMS  file  "LOSSXX  ARRAY  C",  wnere  XX  is  the  desired 
fiscal  year  (same  as  above). 

•  Ihe  CMS  file  "LCSSXX  ARRAY  C",  can  be  read  into  an  APL 
workspace  using  similar  methods  as  tne  inventory  tile 
described  above. 
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iSTACK  N 

3cSTACK    )WSID    MYSPACE 

ArSTACK    )SA\T: 

iSTACK    )OFF    HOLD 

EXEC  APL 

-SKIP   &TYPE    ■''''''>      IS   THIS   A   NEW   WORKSPACE''     Y  OR    N 

ArREAD  VAR   3cNEW 

iclF  .ScNEW   EQ  Y  iCOTO    -JUMP 

CP   TERMINAL  APL  ON 

&STACK   )LOAD  iWKS 

.ScSTACK   )PCOPY   MYSPACE    &;A 

&STACK   )SAVi: 

icSTACK   )DROP    MYSPACE 

ASTACK  )OFF   HOLD 

EXEC  APL 

&EXIT   98 

-JUMP   CP   TERMINAL   APL   ON 
ScSTACK   )WSID  iWKS 
&STACK   )PCOPY   m'SPACE    icA 

&STACK   )SAVE 

.ScSTACK    )DROP    MYSPACE 

iSTACK    jOFF    HOLD 

EXEC  APL 

iEXIT   99 

-TELL  JcTYPE  YOU    HAVE    ENTERED   TOO   NUiJY  OR    NOT   ENOUGH   ENTRIES 

iTYPE    FOR    THE    FILE    THAT   YOU   W.ANT    TO    BE    READ    INTO   APL. 

iTYPE  YOU   NEED  TO   BEGIN  AGAIN 

.iTYPE 

iTYPE    ENTER:  CMSAPL 

iEXIT    100 

-ERROR   iTYPE   ici.    &2   &3   DOES    MOT   EXIS   ON  YOUR   iCONCAT   OF   ic3    -DISK 

iTYPE   CHECK  YOUR    FLIST   AND   THEN    BEGIN    AGAIN 

&TYPE    BY  ENTERING  CMSAPL 

,3cEXIT    IQI 


I 


figure  D.6    CHSAPL  EIEC  A1  (CONT.) . 
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i 
c 

PROGRAM  LOSSES 

T 

c 

***wW*****ir^***1r*w**»-V******»****-****-*r-T*T»-V'r»»*r'<r 

W  *  *  T  * 

c 

***  REFORMATS  DATA  FROM  'COUNTS'  FILE  INTO  A  TWO 

T  •  * 

c 

***  DIMENTIOMAL  ARRAY  OF  LOSSES   FOR  EACH  YEAR. 

W  *  • 

c 

***  THE  OUTPUT  IS  A  SET  OF  INDICES  FOR  MATRIX 

*  *  * 

c 

***  FORMATION  IN  APL. 

#  ♦  • 

c 

*  *  * 

»  *  * 

c 

***               INDEX  SUMMARY 

W  »  W 

c 

*'*   COL    CHAR            INFO 

*  ^  * 

c 

*'*    1      1      YEAR  (  1, 2,  .  .  . , 7}  =  (77, 73 33)  '-' 

c 

***   2,3     2      MOS  (_6o,0l 39) 

**'    4      1      GRADE  (6, 1, . . . ,9) 

TT  *  * 

c 

*  IT  * 

c 

***   5,6     2      LOS  (0,  1 36) 

***  8, §,10   3      NUMBER  OF  LOSSES 

*  *  * 

c 

if  it  -k 

c 

*  *  vr 

*  «  * 

c 
c 

*«*       USE  "LOSS  <YEAR>"      EXECUTIVE 

*  *  * 

•  ********«*******«**#-W'Wwi«'*w****'*r»w»rir****«f-w»ir*'W******» 

c 
c 

INTEGER  GRADE, MOS,  YEAR.  LOSS,  LOS,  lYR 

THE  DO  LOOPS  CORRESPOND  TO  THE  NUMBER  OF  RECORDS 

FOR  '. 

'EARS  77-83 

c 
c 
c 

THE  FIRST  ONE  IS  THE  FULL  DATA  SET,  THE  SECOND  IS  FOR 

TESTING 

DO  200  1=1, 16093 

c 

DO  200  1=1,50 

c 

DO  200  1=1,2203 

c 

DO  200  1=1,2231 

c 

DO  200  1=1, 2337 

c 

DO  200  1=1,2351 

c 

DO  200  1=1,2317 

c 

DO  200  1=1,2324 

c 
c 

c 

101 

DO  200  1=1, 2330 

**  LI  -  LB  IS  LOSS  TYPE  CODE  1-8 

READ ( 8, 60) YEAR ^ MOS, GRADE, LOS, INV, LI , L2 , L3 , L4 , L5 , 
FORMAT  (412,91^) 

L6,  L7 

LB 

60 

c 
c 
c 

'**  INITIALIZE  THE  INDEX   1  =  77,2  =  78 7=83 

lYR  =  YEAR  -  76 

c 

LOSS  =  100000000* lYR 

IF(L1.GT.0 

CALL  SUM  IN, LOSS, MOS, GRADE, LOS, 1, LI) 

IF(L2.GT.O 

CALL  SUM  IN,  LOSS,MOS,GRA.DE,  LCS,2,  L2 

IF  L3 .GT.O 

CALL  Siml IN, LOSS. MOS, GRADE. LOS, 3,  L3 

IF  L4.GT.0 

CALL  SUM  IN, LOSS, MOS, GRADE, LC3, 4, L4 

IF  L5. GT.O 

CALL  3UM( IN, LOSS , MOS , GRADE , LC3, 5, L5 

IF  L5. GT.O 

CALL  SUM( IN, LOSS, MOS, GRADE. LOS, 6, L6 

IF  L7. GT.O 

CALL  SUM  IN,  LOSS,  MOS,  GRADE,  LOS,  7,  l:- 

IF(L3.GT.O 

CALL  SUM  IN. LOSS, MOS, GRADE, LOS, a, L3 

200 

CONTINUE              '   " 
STOP 

c 

END 

c 
c 

THIS  SUBROUTINE  CREATES  THE  INDEX  ARRAY  FOR  LOSS 

3  DATA 

ELEMENT 

SUBROUTINE  SUM( INDEX, I, J.K, L,M,NUM) 
INTEGER  INDEX, 1 , J^K^ L^M , NUM 

INDEX=  I*(  J*ic606060  t-(K*100C0QO)^(L*10OOG)* 
WRITE  (9,500)  INDEX 

M-1000)*NUM       1 

500 

FORMAT  (110) 
RETURN 

E:ro 

Figure    D.8         LOSSES    FORTRAN    Al. 

Ihe  file  of  indices  for  tne  losses  is  siailar  to  the 
inventory  file.  The  two  aimensionai  array  Eor  eacn  fiscal 
year's  loss  is  characteri  zea  by  a  leut^th  suLJect  to  the 
number  cf  records  in  the  summary  data  rile  for  a  particular 
fiscal  year  in  whicu  there  is  a  nonzero  loss  count  for  each 
loss    type-       It    snould    be    i.ottd    that    a    record    in    the    original 
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d.   Function  GETINV 

The  fanction  to  create  the  inventory  iLdtLix  ror 
each  desired  year  is  "GETINV".  See  figure  D. 9.  "GETINV" 
utilizes  the  function  "INYMATX  INVXX",  whicn  interprets  the 
inventory  index  matrix  "INVXX"  and  creates  the  inventory 
matrix  for  riscai  year  "XX".  See  Figure  D.10.  The  resul- 
tant matrix  is  "IXX"  for  fiscal  year  "XX".  The  function 
"INVMATX"  could  create  a  matrix  of  the  following  dimension 
7xUOx10xJl  lor  7  years,  ^0  MOS's,  10  grades,  and  31  LOS's. 
However,  due  to  limited  workspace,  the  dimension  of  UOxilxlO 
for  40  ."lOS's,  31  LOS's,  and  10  grades  was  commonly  utilized. 


V  GETINV 
[1]     0  GET  THE  NUMBER  OF  YEARS  OF  INVENTORY  DESIRED.  j 

[2]     «  COMMENT  OUT  YEARS  NOT  USED  IMMEDIATELY.  I 

[3]       177-INVMATX   irJV77  J 

[4]       I7S^INVMATX   irslV78 
[5]       I79*-INVMATX  INV79 
[6]       laO^INVMATX  irjvso 
[7]       131  -INVMATX   irJVSI 
[8]     «  182-INVMATX  INV32 
[9]     0  I83^INVMATX  INV83 
[10]     '  SHAPE  OF  177  IS  ' 
[n]     D-pl77 


Figure  D.9   APL  Function  GETINV, 


t.   Group  Inventory  Functions 

There  is  a  separate  function  to  make  a  combined 
matrix  for  each  of  tne  MOS  groups.  Listed  in  Table  XXIII 
are  the  groups,  the  function  used  to  create  the  desired 
matrix,  and  the  shape  of  the  resultant  matrix.  Figure  D. 1 1 
is  d  representation  of  the  programing  technigue  used  ror  the 


139 


V  GETGC;K 

[I]  0  THIS  GETS  THE  INVENTORY  MATRIX  FOR  THE  GROUND 
[2]     A  COMBAT  OCCUPATION  GROUP. 

[3]     fl  SHAPE  IS  (MOS;YEARS.LOS;GRADE) 

[4]     «   ♦.  CHECK  THE  2ND  INDEX   FOR  THE  CORRECT 

[5]     0   ♦♦  NUMBER  OF  VEARS   DESIRED. 

[6]       GC*-(3  4  31    I0)p0 

[7]     Q       THIS  GETS  THE  MOS:  INFANTRY 

[a]       GC[1..,]-GETM0S  3 

[9]     «       THIS  GETS  THE  MOS:   ARTILLERY 

[10]     GC[2;::]^GETM0S  5 

[II]  a       THIS  GETS  THE   MOS:   TANKS  AND  AMPHIB 
[12]     GC[3,..]-GETM0S   10 


Figure    D.11         APL    Function   GETGC. 


[1] 
[2] 
[3] 
[-^1 
[5] 
[6] 
[7] 
[S] 
[9] 
[10] 

[11] 
[12] 


V    Z-GETMOS   X,J;Y 
ft  GET  THE  CENTRAL  DATA  NUMBER  OF  INVENTORY 
ft  FOR  THE  >EARS  DESIRED  FOR  A  PARTICULAR  MOS. 
ft  X=   MOS  DESIRED.     CHANGE  THE  irJDEX  FOR   2  IF 
ft  YEARS  ARE  NOT  SEQUErJTiAL. 

J*-X-l-1 

Z-(4  31    10)pO 


z[i 

Z[2 

Z[3 

Z[4 

«Z[5 

9Z[6 


;]^((Y-(I77[J;,]  +  I78[J 
,]-((Y-(l78[j;,]  +  l79[j 
.]-((Y-(l79[j;;]  +  l80[J 
,]-((Y-(l80[j.;]  +  lSl[J 
;]-((Y-(l8l[J.,]  +  i82[J 
;]-((r-(l8:[J..]  +  l83[j 


;])-2)f(L77[j;;])) 
;])-2)[(L78[j;;])) 
.])-2)[(L79[J:;]1) 
:])-2)r(LS0[j;,])) 
;])-2)[(L3l[J;,])) 
.])-2)[(L82[J;,])) 


Figure    D.12        APL    Function   GETMOS. 

To      create   the      inventory      matrix      used    ior      the 
Jdmes-Stem    runction    the    roiiowinvj    tunctioriS    we  te    used: 

•  GEING  -  creates  the  "NG"  matrix  tor  GroaiiQ  Comtat.  See 
Figure  D.13  for  an  exaaple  of  the  program  utilized  to 
create    the      the    Ground      ComDat    inventory       matrix.         The 
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a.      Function    GETL0S3 

The    fanctj.cn    to   create    the      loss   aatirix    for    each 
desired    year      is    "GETIOSS".  See    Fi^jure      D.14.         "GETLOSS" 

utilizes  the  function  "MATRIX  LOSSXX",  khicn  interprets  tue 
loss  index  matrix  "LCSiiXX"  and  creates  the  loss  matrix  for 
fiscal  /ear  "XX".  The  resultant  matrix  is  "LXX"  tor  fiscal 
year  "XX".  The  function  "MAIFIX"  could  create  a  matrix  of 
the  fclloking  dimeiisicn  7x40x10x31x8  for  7  years,  40  M05's, 
10  grades,  31  LOS's,  and  8  loss  types.  Hoyever,  due  to 
limited  workspact,  the  dimension  of  40x31x10  for  40  MO^'s, 
31  LCS's,  and  10  grades  was  commonly  utilized  when  the 
losses    were   aggregated.  Each    loss    typt    could      be    examined 

witn    the   saiLe    dimensicced    matrix.       See    Figure    D.15. 


V  GETLOSS 

[1]  «  THIS  GETS  THE  L(XX)  MATRIX  FOR  YEARS 

[2]  a  XX  =  77,...,83. 

[3]  L77-MATRIX   LOSS77 

[4]  L78-MATRIX   LOSS78 

[5]  L79-MATRIX   LOSS79 

[6]  L80-MATRIX  LOSS80 

[7]  «L81 -MATRIX  L0SS31 

[8]  «L82*-MATRIX  LOSS32 

[9]  OL83-MATRIX  LOSS83 


figure   D.14        APL    Function   GETLOSS, 


b.       Group    Loss    Functions 

There  is  a  separate  function  to  uiaite  a  coibined 
matrix  tor  tach  of  tne  M05  groups.  Listed  in  rarie  XXIV  are 
tue  groups,  the  function  used  to  create  tne  desired  matrix, 
and    the    shape    of    tne    resultant    matrix.         See    Figure    D.lfc    for 
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the    Ground      Combat   function. 
were    siailar. 


The    other      groups*    functions 


• 

V  GCLOSS 

[1] 

a     THIS  GETS  THE  LOSSES  FOR  GROUND  COMBAT 

[2] 

0     FOR  6  YEARS  77   -   82. 

[3] 

B     SHAPE  IS  (MOS, YEAR, LOS, GRADE) 

[4] 

0     CHANGE  YEAR  INDEX  TO  DESIRED  NLIMBER 

[5] 

GCL-(3  4  31    I0)p0 

[6; 

GCL[1;1;;]-L77[4.;] 

[7: 

GCL[1;2;;]-L78[4;;] 

[8] 

GCL[1;3;;]-L79[4.;] 

[9] 

GCL[1;4;,]-L80[4;;] 

[lo; 

fl  GCL[1,5,.]-L81[4;;] 

[n: 

«  GCL[1.6;.]-L82[4;.] 

[12; 

GCL[2,1„]-L77[6;,] 

[13; 

GCL[2;2;;]-L78[6;;] 

[14: 

GCL[2;3;:]-L79[6;:] 

[15: 

1     GCL[2;4;;]-L80[6::] 

[16: 

1  «  GCL[2.5,;]-L81[6;,] 

[17: 

0  GCL[2.6;,]-L82[5.;] 

[is: 

GCL[3;1;.]-L77[11.;] 

[19- 

GCL[3;2,,]-L78[11;;] 

[20. 

GCL[3;3;:]-L79[11;:] 

[2i: 

GCL[3;4;,]^L80[11;.] 

[22' 

fl  GCL[3,5;,]-L81[1  1;;] 

[23. 

fi  GCL[3;6;;]-L82[n;;] 

Figure    D.16        APL    Function    GCLOSS. 

These  functions  Uiie  the  gioDai  variables  "L.Ci"  , 
ror  eacii  "XX"  fiscal  year.  In  ordt^r  to  get  a  Coacat  Service 
Support  MOS  loss  matrix,  the  "LXX"  matrix  was  used  directly 
to    remove    tne    desired    comtination    of    cells. 

To  create  the  loss  aatrix  used  tor  the 
James-Stein    function    the    lolloping    functions    were    used: 

•  GEiYC    -    creates    the    "YC"    matrix    for    ComDat    Support. 

•  GEIYG    -    creates    the    "YG"    matrix    ror    Grouud    Combat.       See 
Figure    D.17    for    an   example. 
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•  let   K   =    number    of   feasible   cells,    i.e.,    sum   of    ail   Dij. 

•  Let    Yij   =   number    of   leavers   in   cell    (i,  j)  . 

•  Let  t   =    1,...,T;    wnere    I    =   number   of    years    of    data    used 
to    create  the  estimator. 

•  let        Nij        =        Inventory      in        cell         (i#j)  =        Max 
((N{t)  +N(t+l))/2,y(t))  . 

Ihe  foiiowing  API  functions  were  utilized  in  the 
James-Stein  model.  See  Figure  D.18  for  the  APL  listings  of 
the    fcllowing    functions. 

a.  Function    ARCSIN 

Let  ISM  =  Nij  APCSIN  PMMij.  The  function 
"ARCSIN"  returns  the  inverse  sine  transformation  for  use 
when  the  success  probability  is  estimated  directly  (e.g.,  by 
MINMAX)  . 

b.  Function    EINCONV 

Let  Pij  =  Jij  BINCONV  Mij(t).  The  function 
"BINCCNV"   inverts   the   inverse    sine   transformation. 

c.  Function    EINPREP 

Let  ISij(t)  =  Yij(t)  BINPREP  Ni j  (t)  .  The  func- 
tion "BINPREP"  prepares  the  Freeman-Tukey  version  of  the 
inverse  sine  transformation  for  binomial  data. 

d.  Function    JAiiES 

Let  Jij  =  Dij  JAMES  ISi j  (t)  .  The  function 
"JAMES"  returns  a  James-Stein  estimator  for  the  means  of  the 
means  of  the  cells  in  the  last  two  dimensions  of  the  input 
matrix  while  being  screened  by  tne  incidence  matrix  of 
nonstructural    zeroes. 

e.  Function    MINMAX 

Let    PMMij    =    Yij  (t)     MINMAX    Nij(t).         Tne    function 
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function  was  utilized  in  the  validation  procedures  to 
compare  the  James-Stein  estimator  to  the  MINMAX  estiJiator. 
See    fiyure    D.  19    for    the    AtL    listin^j   of    tnis    function. 


V   P-Y  MINMAX   N;YY 

[1]  «   RETURNS  MINMAX   EST'S  FOR   BINOMIAL 

[2]  fi   INCLUDES  REDUCTION  OVER  THE    1st  RANK   (TIME)  OF  THE 

[3]  «  INPUTS.     NN  (GLOBAL)  IS  CREATED  FOR  USE  IN  ARC  SINE. 

[4]  YY^+/Y 

[5]  NN^-t-/N 

[6]  P*-(0.5  +  (0?iYY)xYY-NN*0  5)-1+NN*0.5 


Figure  D.19    APL  Function  MINMAX, 


**  •   V^li^^tion  Functions 

The  following  APL  functions  are  an  example  of  the 
functions  utilized  in  the  validation  procedures  specified  in 
Chapter  5.  Each  MOS  group  (Aviation,  Combat  Support,  ComDat 
Service  Support,  and  Ground  Combat)  was  unique  in  its  set  of 
functions.  However,  the  methodology  was  similar  in  manipa- 
lating    the    data. 

a.       Function    BEFORE 

The  function  "BEFOfiE"  performed  the  preliminary 
work,  foi  the  parameters  lor  the  James-Stein,  Miniaax,  and 
Maximum    Likelihood  estimation    processes.       See    Figure    D.2). 

t.       Function    GETGC8  1 

The  function  "GETGCBl"  derives  the  Normal  values 
for  the  James-Stem,  Minimax,  and  Maximum  Likelihood  estima- 
tors.      See    Figure    D.21. 
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V  GETGC81,CS81.Y;N5,NS;Y5;YS;IG5,IG8 
>  GET5  THE  NORMALS  FOR  THE  JAMES-STEIN,  MINMAX. 
»  AND  MAXLIKE  FOR  ThiE  GROUND  COMBAT  MOS  FOR  THE 
»  PROJECTED  YEAR    1981.     ALSO  COMPUTES  THE  CHI 
»  SQUARE  VALUES  FOR  VALIDATION  PURPOSES. 

1-0 

N5-N81G5 

N8-N81G8 
Y5-Y81G5 

>8-Y3lG8 
IG5-(Y5  BINPREP  N5) 
IG8-(Y8  BINPREP  N3) 

NOR31G5-IG5-JG5 

NOR81GS-IG8-JG8 

DD-DLT 

P-PG5 

CSSlG-(2  6)pO 

CS81G[1;   1    2]-Y5  CHI  N5 

DD-DLC 

P-PG8 

CS81G[2:   1    2]-Y8  CHI  N3 

NMM81G5-IG5-IM5 

NMMS1G8-IG8-IM8 

DD'-DLT 

P-PM5 

CSS1G[1.  3  4]-Y5  CHI  N5 

DD-DLC 

P-PM8 

CS81G[2;  3  4]-Y8  CHI  N8 

NA81G5-IG5-IA5 

NA81GS-IG8-IA8 

DD-DLT 

P-PA5 

CS81G[1;  5  6]-Y5  CHI  N5 

DD-DLC 

P-PAS 

CS81G[2;  5  6]-Y8  CHI  N8 

JAMES  DF  MINMAX  DF  MAXLIKE 

D<-CS81G 


DF' 


Figure  D.21    APL  fanction  GETGC81 
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V  z« 

:i] 

>] 
:5] 
:6] 
:7] 
:8] 
>] 
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1 1 

12 

14 
^15 
16 
17 
18 
19 
20 
^^1 

"23 
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26 
"27 
^28 
"29 

30 
'31 

^2 
53 

^34 
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■37 
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■Y  TEST  N.A;B.C;DD,IS;J.P;MLE,AC;SSEM,SEM,SSEP,SEP.DIFP;DIFM,NUM 
a  Y  IS  LEAVERS.  SHAPE  (2  \EARS.  31   LOS.  MOS'S) 
«  D  IS  THE  GLOBAL  INCIDENT  MATRIX  FOR  THE  DESIRED  GRADE 
fl  N   IS  CENTRAL  INVENTORY,   SHAPE  (2  YEARS,   31    LOS,   MOS'S) 
IS^Y  BINPREP  N 
J-D  JAMES  IS 
P^(J  BINCONV  N)xD 
MLE-((Y[l;;])-(N[l;,]  +  (N[l,,]  =  0)))xD 
AC-((+A)-((  +  /N)  +  ((  +  /N)  =  0)))xD 
SSEM-+/,SEM-(AC-MLE)-2 
SSEP^+/,SEP-(AC-P)*2 
DIFP-+/.(SEP<SEM) 
DIFM-+/,(SEM<SEP) 
NUM*-+/,D 

1    THIS  IS  TEST   1.  ALL  THE  CELLS  IN  THE  FEASIBLE  REGION' 
ARE  ESTIMATED  BY  THE  JAMES  STEIN   PROCESS' 

SEP  =  THE  SQUARED  DIFFERENCE  OF  THE  JS  PROJECTED  AND  ACTUAL' 
SEM   =  THE  SQUARED  DIFFERENCE  OF  THE  MLE  AND  ACTUAL' 
SSE(MLE)   =  THE  SUM  OF  THE  SQUARED  DIFFERENCES  OF  SEM' 
SSE(P)   =  THE  SUM  OF  THE  SQUARED  DIFFERENCES  OF  SEP' 

FEASIBLE  CELLS     SEP<SEM     SEM<SEP     SSE(MLE)     SSE(P)' 
A^' 


C-' 

DD-'       ' 

B.(»NUM),A,(»01FP),B,(»DIFM),C,(»SSEM),DD.(»SSEP) 


SSEM-+/,SEM-(AC-(MLEx(AC;^0)))»2 

SSEP-+/,SEP-(AC-(P-<(AC^0)))«2 

DIFP»-+/.(SEP<SEM) 

DIFM-4-/,(SEM<SEP) 

rJUM^+/.(AC?i0) 

THIS  IS  TEST  2,   ONLY  THE  CELLS  IN  THE  ACTUAL  REGION    >0' 

ARE  REPRESENTED  IN  THE  COMPARISON  TEST  ' 

SEP2   =  THE  SQUARED  DIFFERENCE  OF  THE  JS  PROJECTED  AND  ACTUAL' 

SEM2  =  THE  SQUARED  DIFFERENCE  OF  THE  MLE  AUD  ACTUAL' 

SSE2(MLE)   =   THE  SUM  OF  THE  SQUARED  DIFFERENCES  OF  SEM2' 

SSE2(P)   =  THE  SUM  OF  THE  SQUARED  DIFFERENCES  OF  SEP2' 

FEASIBLE  CELLS     SEP2<SEM2     SEM2<SEP2     SSE2(MLE)     SSE2(,P)' 
A*-' 
B^' 
C^' 
DD-' 
B,(«NUM),A.(»DIFP).B,(»DIFM),C,('SSEM),DD.(»SSEP) 


Fiqure    D.2U         API    Function    TEST. 
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